Red Hat AI

Enterprise

company

Verified

https://www.redhat.com/en/products/ai

RedHat_AI

RedHatOfficial

Activity Feed

AI & ML interests

OpenSource and AI

Recent Activity

nm-research updated a collection 1 day ago

NVFP4 Models

alexmarques updated a collection 2 days ago

Speculator Models

MeganEFlynn updated a model 2 days ago

RedHatAI/Qwen3-235B-A22B-speculator.eagle3

View all activity

Organization Card

Community About org cards

Red Hat AI Build AI for your world

The Red Hat AI repository on Hugging Face is an open-source initiative backed by deep collaboration between IBM and Red Hat’s research, engineering, and business units. We’re committed to making AI more accessible, efficient, and community-driven from research to production.

We believe the future of AI is open. That’s why we’re sharing our latest models and research on Hugging Face, which are freely available to help researchers, developers, and organizations deploy high-performance AI at scale.

🔧 With Red Hat AI, you can:

Use or build optimized foundation models, including Llama, Mistral, Qwen, Gemma, DeepSeek, and others, tailored for performance and accuracy in real-world deployments.
Customize and fine-tune models for your workflows, from experimentation to production, with tools and frameworks built to support reproducible research and enterprise AI pipelines.
Maximize inference efficiency across hardware using production-grade compression and optimization techniques like quantization (FP8-dynamic, INT8, INT4), structured/unstructured sparsity, distillation, and more, ready for cost-efficient deployments with vLLM.
Validated models by Red Hat AI offer confidence, predictability, and flexibility when deploying third-party generative AI models across the Red Hat AI platform. Red Hat AI validates models by running a series of capacity planning scenarios with GuideLLM for benchmarking, Language Model Evaluation Harness for accuracy evaluations, and vLLM for inference serving across a wide variety of AI acclerators.

🔗 Explore relevant open-source tools:

vLLM – Serve large language models efficiently across GPUs and environments.
LLM Compressor – Compress and optimize your own models with SOTA quantization and sparsity techniques.
Speculators – Build, evaluate, and store speculative decoding algorithms for LLM inference in vLLM.
InstructLab – Fine-tune open models with your data using scalable, community-backed workflows.
GuideLLM – Benchmark, evaluate, and guide your deployments with structured performance and latency insights.

Or learn more about our full product suite at https://www.redhat.com/en/products/ai

Collections 17

View 17 collections

models 568

datasets 1

RedHatAI/speculator_benchmarks

Preview • Updated Nov 4, 2025 • 268

Red Hat AI

AI & ML interests

Recent Activity

Red Hat AI Build AI for your world

Collections 17

RedHatAI/Mistral-Large-3-675B-Instruct-2512

RedHatAI/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

RedHatAI/Mistral-Large-3-675B-Instruct-2512-NVFP4

RedHatAI/Apertus-8B-Instruct-2509-FP8-dynamic

RedHatAI/Mistral-Small-3.2-24B-Instruct-2506-NVFP4

RedHatAI/Qwen3-VL-235B-A22B-Instruct-NVFP4

RedHatAI/Qwen3-235B-A22B-Instruct-2507-NVFP4

RedHatAI/Qwen3-235B-A22B-NVFP4

RedHatAI/Mistral-Large-3-675B-Instruct-2512

RedHatAI/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

RedHatAI/Mistral-Large-3-675B-Instruct-2512-NVFP4

RedHatAI/Apertus-8B-Instruct-2509-FP8-dynamic

RedHatAI/Mistral-Small-3.2-24B-Instruct-2506-NVFP4

RedHatAI/Qwen3-VL-235B-A22B-Instruct-NVFP4

RedHatAI/Qwen3-235B-A22B-Instruct-2507-NVFP4

RedHatAI/Qwen3-235B-A22B-NVFP4

models 568

RedHatAI/Qwen3-235B-A22B-speculator.eagle3

RedHatAI/NVIDIA-Nemotron-Nano-9B-v2-quantized.w4a16

RedHatAI/NVIDIA-Nemotron-Nano-9B-v2-FP8-dynamic

RedHatAI/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

RedHatAI/Voxtral-Mini-3B-2507-FP8-dynamic

RedHatAI/Qwen3-Coder-480B-A35B-Instruct-FP8

RedHatAI/Apertus-8B-Instruct-2509-FP8-dynamic

RedHatAI/Mistral-Large-3-675B-Instruct-2512-NVFP4

RedHatAI/Mistral-Large-3-675B-Instruct-2512

RedHatAI/Qwen3-Next-80B-A3B-Thinking-FP8-dynamic

datasets 1

RedHatAI/speculator_benchmarks

AI & ML interests

Recent Activity

Team members 114

Red Hat AI Build AI for your world

Collections 17

models 568 Sort: Recently updated

datasets 1

models 568