📢 NVIDIA Releases Nemotron-CC-Math Pre-Training Dataset: A High-Quality, Web-Scale Math Corpus for Pretraining Large Language Models 24 days ago • 2
NVIDIA Releases Improved Pretraining Dataset: Preserves High Value Math & Code, and Augments with Multi-Lingual 24 days ago • 2
NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks about 1 month ago • 73
Llama-NeMoRetriever-ColEmbed: Developer-Focused Guide to NVIDIA's State-of-the-Art Text-Image Retrieval Jul 9 • 4
Nemotron-Personas: Improve AI Training With the First Synthetic Personas Dataset Aligned to Real-World Distributions Jun 10 • 16
NVIDIA Nemotron Open, Production-ready Enterprise Models. Nvidia Open Model license. nvidia/NVIDIA-Nemotron-Nano-12B-v2 Text Generation • 12B • Updated 1 day ago • 37.1k • 69 nvidia/NVIDIA-Nemotron-Nano-9B-v2 Text Generation • 9B • Updated 12 days ago • 90.5k • 335 nvidia/NVIDIA-Nemotron-Nano-9B-v2-Base Text Generation • 9B • Updated 15 days ago • 3.28k • 34 nvidia/NVIDIA-Nemotron-Nano-12B-v2-Base Text Generation • 12B • Updated 15 days ago • 3.06k • 73
Nemotron-Pre-Training-Dataset nvidia/Nemotron-Pretraining-Dataset-sample Viewer • Updated 16 days ago • 27.7k • 1.82k • 18 nvidia/Nemotron-CC-Math-v1 Viewer • Updated 9 days ago • 145M • 9.9k • 39 nvidia/Nemotron-CC-v2 Viewer • Updated 16 days ago • 5.81B • 63.1k • 70 nvidia/Nemotron-Pretraining-SFT-v1 Viewer • Updated 16 days ago • 358M • 3.87k • 20
OpenReasoning-Nemotron Collection of models for OpenReasoning-Nemotron which are trained on 5M reasoning traces for Math, Code and Science. nvidia/OpenReasoning-Nemotron-1.5B Text Generation • 2B • Updated 13 days ago • 2.75k • 43 nvidia/OpenReasoning-Nemotron-7B Text Generation • 8B • Updated 13 days ago • 4.7k • • 42 nvidia/OpenReasoning-Nemotron-14B Text Generation • 15B • Updated 13 days ago • 1.88k • 39 nvidia/OpenReasoning-Nemotron-32B Text Generation • 33B • Updated 13 days ago • 2.65k • • 112
Reward Models Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge nvidia/Llama-3_3-Nemotron-Super-49B-GenRM Text Generation • 50B • Updated Jun 26 • 150 • 16 nvidia/Llama-3_3-Nemotron-Super-49B-GenRM-Multilingual Text Generation • 50B • Updated Jun 26 • 296 • 6 nvidia/Llama-3.3-Nemotron-70B-Reward Text Generation • 71B • Updated Jun 26 • 392 • 2 nvidia/Llama-3.3-Nemotron-70B-Reward-Multilingual Text Generation • 71B • Updated Jun 26 • 1.23k • 10
nvidia/Llama-3_3-Nemotron-Super-49B-GenRM-Multilingual Text Generation • 50B • Updated Jun 26 • 296 • 6
nvidia/Llama-3.3-Nemotron-70B-Reward-Multilingual Text Generation • 71B • Updated Jun 26 • 1.23k • 10
AceReason Math and Code reasoning model trained through reinforcement learning (RL) nvidia/AceReason-Nemotron-14B Text Generation • 15B • Updated Jun 17 • 12.7k • • 91 nvidia/AceReason-Nemotron-7B Text Generation • 8B • Updated Jun 17 • 6.7k • • 19 nvidia/AceReason-Nemotron-1.1-7B Text Generation • 8B • Updated Jul 11 • 7.37k • • 56 nvidia/AceReason-Math Viewer • Updated Jun 18 • 49.6k • 1.7k • 33
Nemotron-H Mamba-Transformer hybrid models nvidia/Nemotron-H-47B-Reasoning-128K Text Generation • 47B • Updated Jul 11 • 747 • 18 nvidia/Nemotron-H-8B-Reasoning-128K Text Generation • 8B • Updated Jul 11 • 10.3k • 22 nvidia/Nemotron-H-8B-Reasoning-128K-FP8 Text Generation • 8B • Updated 21 days ago • 201 • 12 nvidia/Nemotron-H-47B-Reasoning-128K-FP8 Text Generation • 47B • Updated 21 days ago • 106 • 5
Describe Anything Multimodal Large Language Models for Detailed Localized Image and Video Captioning Runtime error 336 336 Describe Anything ⚡ Describe masked parts of images using prompts nvidia/DAM-3B Image-Text-to-Text • Updated May 7 • 6.47k • 127 nvidia/DAM-3B-Video Image-Text-to-Text • Updated May 7 • 933 • 56 nvidia/DAM-3B-Self-Contained Image-Text-to-Text • Updated May 7 • 6.17k • 23
OpenMathReasoning Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset" nvidia/OpenMathReasoning Viewer • Updated May 27 • 5.68M • 7.84k • 335 nvidia/OpenMath-Nemotron-1.5B Text Generation • 2B • Updated Apr 30 • 3.56k • • 24 nvidia/OpenMath-Nemotron-7B Text Generation • 8B • Updated Apr 30 • 1.55k • • 9 nvidia/OpenMath-Nemotron-14B Text Generation • 15B • Updated Apr 30 • 1.79k • 12
OpenCodeReasoning-II Reasoning data for supervised finetuning of LLMs to advance code generation and critique nvidia/OpenCodeReasoning-2 Viewer • Updated May 17 • 2.16M • 2.04k • 39 nvidia/OpenCodeReasoning Viewer • Updated May 4 • 753k • 2.74k • 492
Scoring Verifiers Benchmarks for evaluating synthetic verifiers like test case generation and code reward models (as found in https://www.arxiv.org/abs/2502.13820). nvidia/Scoring-Verifiers Updated Apr 1 • 56 • 7 Scoring Verifiers: Evaluating Synthetic Verification in Code and Reasoning Paper • 2502.13820 • Published Feb 19
Scoring Verifiers: Evaluating Synthetic Verification in Code and Reasoning Paper • 2502.13820 • Published Feb 19
Cosmos-Reason1 Multimodal world understanding through reasoning nvidia/Cosmos-Reason1-7B Image-Text-to-Text • 8B • Updated 28 days ago • 355k • 168 nvidia/Cosmos-Reason1-RL-Dataset Viewer • Updated May 20 • 892 • 417 • 11 nvidia/Cosmos-Reason1-Benchmark Viewer • Updated May 20 • 510 • 630 • 6 nvidia/Cosmos-Reason1-SFT-Dataset Viewer • Updated May 20 • 1.71M • 1.6k • 19
Cosmos-Tokenize1 A suite of image and video tokenizers nvidia/Cosmos-Tokenize1-CI8x8-360p Updated Mar 18 • 93 • 2 nvidia/Cosmos-Tokenize1-CI16x16-360p Updated Mar 18 • 84 • 1 nvidia/Cosmos-Tokenize1-CV4x8x8-360p Updated Mar 18 • 88 • 1 nvidia/Cosmos-Tokenize1-CV8x8x8-720p Updated Apr 23 • 3.27k • 3
Cosmos-Tokenizer A suite of image and video tokenizers nvidia/Cosmos-0.1-Tokenizer-CI8x8 Updated Nov 11, 2024 • 127 • 32 nvidia/Cosmos-0.1-Tokenizer-CI16x16 Updated Dec 25, 2024 • 107 • 8 nvidia/Cosmos-0.1-Tokenizer-DI8x8 Updated Dec 25, 2024 • 94 • 11 nvidia/Cosmos-0.1-Tokenizer-DI16x16 Updated Dec 25, 2024 • 84 • 9
Physical AI Collection of commercial-grade datasets for physical AI developers nvidia/PhysicalAI-SmartSpaces Updated 6 days ago • 37.1k • 44 nvidia/PhysicalAI-Robotics-Manipulation-Kitchen Viewer • Updated May 15 • 405k • 2.47k • 10 nvidia/PhysicalAI-Robotics-GraspGen Viewer • Updated Jun 21 • 25.5k • 672 • 22 nvidia/PhysicalAI-Robotics-Manipulation-SingleArm Updated May 15 • 14.2k • 12
Cosmos The collection of Cosmos models nvidia/Cosmos-1.0-Guardrail Updated Jun 11 • 1.29k • 56 nvidia/Cosmos-1.0-Autoregressive-4B Updated Feb 11 • 37 • 54
AceMath We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark. nvidia/AceMath-1.5B-Instruct Text Generation • 2B • Updated Jan 17 • 3.58k • 13 nvidia/AceMath-7B-Instruct Text Generation • 8B • Updated Jan 17 • 1.67k • • 26 nvidia/AceMath-72B-Instruct Text Generation • 73B • Updated Jan 17 • 1.84k • 19 nvidia/AceMath-7B-RM Text Generation • 7B • Updated Jan 17 • 8.68k • 6
Eagle 2 Eagle 2 is a family of frontier vision-language models with vision-centric design. The model supports 4K HD input, long-context video, and grounding. nvidia/Eagle2-1B Image-Text-to-Text • 1B • Updated Apr 27 • 4.22k • 25 nvidia/Eagle2-2B Image-Text-to-Text • 2B • Updated Apr 27 • 1.31k • 29 nvidia/Eagle2-9B Image-Text-to-Text • 9B • Updated Jan 28 • 379 • 61
Hymba A series of Hybrid Small Language Models. nvidia/Hymba-1.5B-Instruct Text Generation • 2B • Updated Jan 2 • 436 • 235 nvidia/Hymba-1.5B-Base Text Generation • 2B • Updated Jan 2 • 493 • 149 Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published Nov 20, 2024 • 46
Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published Nov 20, 2024 • 46
NVLM 1.0 A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. nvidia/NVLM-D-72B Image-Text-to-Text • 79B • Updated Jan 14 • 73.5k • 773 nvidia/NVLM-D-72B-mcore Image-Text-to-Text • Updated Jan 14 • 6
Nemotron 4 340B Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. nvidia/Nemotron-4-340B-Instruct Updated Jun 24, 2024 • 40 • 684 nvidia/Nemotron-4-340B-Reward Updated Jun 19, 2024 • 15 • 125 nvidia/Nemotron-4-340B-Base Updated Jun 28, 2024 • 24 • 146 nvidia/HelpSteer2 Viewer • Updated Dec 18, 2024 • 21.4k • 10.8k • 428
Parakeet NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. nvidia/parakeet-rnnt-1.1b Automatic Speech Recognition • Updated Feb 18 • 9.27k • 158 nvidia/parakeet-ctc-1.1b Automatic Speech Recognition • 1B • Updated Jul 29 • 2.87k • 33 nvidia/parakeet-rnnt-0.6b Automatic Speech Recognition • Updated Feb 18 • 175k • 10 nvidia/parakeet-ctc-0.6b Automatic Speech Recognition • Updated Aug 22, 2024 • 3.05k • 16
InstructRetro InstructRetro is an autoregressive decoder-only language model (LM) with retrieval-augmented pretraining and instruction tuning. nvidia/retro-48b-instruct-4k Text Generation • Updated May 29, 2024 • 20 nvidia/retro-8b-instruct-4k Text Generation • Updated May 29, 2024 • 14
RLHF A collection of models trained with Reinforcement Learning from Human Feedback (RLHF). nvidia/NV-Llama2-70B-RLHF-Chat Text Generation • Updated Mar 9, 2024 • 4 nvidia/NV-Llama2-13B-RLHF-RM Text Generation • Updated Mar 9, 2024 • 36 • 3 nvidia/sft_datablend_v1 Viewer • Updated Mar 9, 2024 • 128k • 50 • 14 nvidia/Daring-Anteater Viewer • Updated Jun 17, 2024 • 99.5k • 1.01k • 26
Llama3-ChatQA-1.5 Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). nvidia/Llama3-ChatQA-1.5-8B Text Generation • 8B • Updated May 24, 2024 • 11.6k • 554 nvidia/Llama3-ChatQA-1.5-70B Text Generation • 71B • Updated May 24, 2024 • 151 • • 333 nvidia/ChatRAG-Bench Viewer • Updated May 24, 2024 • 34.6k • 1.05k • 113 nvidia/ChatQA-Training-Data Viewer • Updated Jun 4, 2024 • 442k • 614 • 172
Nemotron 3 8B The Nemotron 3 8B Family of models is optimized for building production-ready generative AI applications for the enterprise. nvidia/nemotron-3-8b-base-4k Text Generation • Updated Feb 9, 2024 • 4 • 91 nvidia/nemotron-3-8b-chat-4k-sft Text Generation • Updated Feb 9, 2024 • 10 nvidia/nemotron-3-8b-chat-4k-rlhf Text Generation • Updated Feb 9, 2024 • 6 • 27 nvidia/nemotron-3-8b-chat-4k-steerlm Text Generation • Updated Feb 9, 2024 • 2 • 22
MambaVision MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. nvidia/MambaVision-L3-512-21K Image Classification • 0.7B • Updated Mar 29 • 292 • 52 nvidia/MambaVision-L3-256-21K Image Classification • 0.7B • Updated Mar 29 • 67 • 7 nvidia/MambaVision-L2-512-21K Image Classification • 0.2B • Updated Mar 29 • 166 • 3 nvidia/MambaVision-L-21K Image Classification • 0.2B • Updated Mar 29 • 107 • 4
Minitron A family of compressed models obtained via pruning and knowledge distillation nvidia/Mistral-NeMo-Minitron-8B-Base Text Generation • 8B • Updated Aug 22, 2024 • 4.32k • 177 nvidia/Mistral-NeMo-Minitron-8B-Instruct Text Generation • 8B • Updated Oct 9, 2024 • 2.07k • 80 nvidia/Llama-3_1-Nemotron-51B-Instruct Text Generation • 52B • Updated Jul 6 • 9.52k • 210 nvidia/Llama-3.1-Minitron-4B-Width-Base Text Generation • 5B • Updated Feb 14 • 4.43k • 191
Llama3-ChatQA-2 This is the collection that presents ChatQA-2, a suite of 128K long-context models, that also have exceptional RAG capabilities nvidia/Llama3-ChatQA-2-70B Text Generation • Updated Sep 10, 2024 • 131 • 12 nvidia/Llama3-ChatQA-2-8B Text Generation • Updated Sep 10, 2024 • 796 • 16 nvidia/ChatQA2-Long-SFT-data Viewer • Updated Sep 9, 2024 • 117k • 293 • 31
Llama Nemotron Open, Production-ready Enterprise Models nvidia/Llama-3_3-Nemotron-Super-49B-v1_5 Text Generation • 50B • Updated 21 days ago • 18.2k • 191 nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-FP8 Text Generation • 50B • Updated Jul 31 • 9.64k • 15 nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 Text Generation • 253B • Updated Jul 6 • 3.9k • • 330 nvidia/Llama-3_3-Nemotron-Super-49B-v1 Text Generation • 50B • Updated May 30 • 23.7k • 319
BioNeMo Accelerated models for digital biology by the NVIDIA BioNeMo team. https://www.nvidia.com/en-us/clara/biopharma/ nvidia/AMPLIFY_350M Fill-Mask • 0.4B • Updated about 16 hours ago • 43 • 4 nvidia/AMPLIFY_120M Fill-Mask • 0.1B • Updated about 16 hours ago • 664 • 4 nvidia/esm2_t6_8M_UR50D Fill-Mask • 0.0B • Updated 14 days ago • 2.37k nvidia/esm2_t12_35M_UR50D Fill-Mask • 0.0B • Updated 14 days ago • 43
Cosmos-Predict2 World Foundation Model for Future Prediction nvidia/Cosmos-Predict2-0.6B-Text2Image Text-to-Image • Updated 10 days ago • 317 • 5 nvidia/Cosmos-Predict2-2B-Text2Image Text-to-Image • Updated Jun 17 • 892 • 63 nvidia/Cosmos-Predict2-2B-Video2World Image-to-Video • Updated Jul 23 • 1.64k • 30 nvidia/Cosmos-Predict2-14B-Text2Image Text-to-Image • Updated Jun 17 • 475 • 43
GEN3C 3D-Informed World-Consistent Video Generation with Precise Camera Control nvidia/GEN3C-Cosmos-7B Updated Jun 18 • 295 • 22 GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control Paper • 2503.03751 • Published Mar 5 • 23 nvidia/GEN3C-Testing-Example Viewer • Updated 20 days ago • 10 • 838 • 2
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control Paper • 2503.03751 • Published Mar 5 • 23
Model Optimizer A collection of generative models quantized and optimized with TensorRT Model Optimizer. nvidia/DeepSeek-R1-0528-FP4 Text Generation • Updated 20 days ago • 67k • 36 nvidia/DeepSeek-R1-FP4 Text Generation • Updated Jun 6 • 14.2k • 264 nvidia/Llama-3.3-70B-Instruct-FP4 41B • Updated 20 days ago • 28.3k • 22 nvidia/Llama-3.3-70B-Instruct-FP8 71B • Updated 20 days ago • 60.2k • 7
Cosmos-Embed1 Joint video-text embedding for physical AI nvidia/Cosmos-Embed1-224p 1B • Updated Jun 10 • 17.9k • 4 nvidia/Cosmos-Embed1-336p 1B • Updated Jun 10 • 1.03k nvidia/Cosmos-Embed1-448p 1B • Updated Jun 10 • 1.32k • 2 Build error Cosmos Embed1 🚀 Cosmos-Embed1 demo app
AceMath-RL Math reasoning models trained through reinforcement learning (RL) nvidia/AceMath-RL-Nemotron-7B Text Generation • 8B • Updated Apr 23 • 2.75k • • 23
OpenCodeReasoning Reasoning data for supervised finetuning of LLMs to advance data distillation for competitive coding nvidia/OpenCodeReasoning Viewer • Updated May 4 • 753k • 2.74k • 492 OpenCodeReasoning: Advancing Data Distillation for Competitive Coding Paper • 2504.01943 • Published Apr 2 • 16 nvidia/OpenCodeReasoning-Nemotron-7B Text Generation • 8B • Updated May 7 • 1.26k • • 37 nvidia/OpenCodeReasoning-Nemotron-14B Text Generation • 15B • Updated May 7 • 547 • 18
OpenCodeReasoning: Advancing Data Distillation for Competitive Coding Paper • 2504.01943 • Published Apr 2 • 16
Llama Nemotron Feedback-Edit Inference-Time Scaling Novel ITS approach for open-ended tasks - No. 1 on Arena Hard on 18 Mar 2025 nvidia/Llama-3.3-Nemotron-70B-Feedback Text Generation • 71B • Updated Mar 18 • 83 • 7 nvidia/Llama-3.3-Nemotron-70B-Edit Text Generation • 71B • Updated Mar 18 • 104 • 3 nvidia/Llama-3.3-Nemotron-70B-Select Text Generation • 71B • Updated Mar 18 • 1.24k • 10 nvidia/HelpSteer3 Viewer • Updated Jul 2 • 99k • 3.09k • 78
Nemotron-UltraLong nvidia/Llama-3.1-Nemotron-8B-UltraLong-1M-Instruct Text Generation • 8B • Updated Apr 17 • 6.12k • 51 nvidia/Llama-3.1-Nemotron-8B-UltraLong-4M-Instruct Text Generation • 8B • Updated Apr 17 • 5.46k • 119 nvidia/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct Text Generation • 8B • Updated Apr 17 • 1.18k • 15
nvidia/Llama-3.1-Nemotron-8B-UltraLong-1M-Instruct Text Generation • 8B • Updated Apr 17 • 6.12k • 51
nvidia/Llama-3.1-Nemotron-8B-UltraLong-4M-Instruct Text Generation • 8B • Updated Apr 17 • 5.46k • 119
nvidia/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct Text Generation • 8B • Updated Apr 17 • 1.18k • 15
Cosmos-Transfer1 Multimodal Conditional World Generation for World2World Transfer nvidia/Cosmos-Transfer1-7B Updated Jul 11 • 3.27k • 49 nvidia/Cosmos-Transfer1-7B-Sample-AV Updated Apr 9 • 1.52k • 14 nvidia/Cosmos-Transfer1-7B-Sample-AV-Data-Example Viewer • Updated Mar 19 • 130 • 71 • 6 nvidia/Cosmos-Transfer1-7B-4KUpscaler Updated Mar 20 • 56 • 6
Cosmos-Predict1 World Foundation Model for Future Prediction nvidia/Cosmos-Predict1-4B Updated Apr 8 • 95 • 2 nvidia/Cosmos-Predict1-5B-Video2World Updated Apr 8 • 82 • 3 nvidia/Cosmos-Predict1-7B-Text2World Updated Apr 8 • 186 • 4 nvidia/Cosmos-Predict1-7B-Video2World Updated Apr 8 • 127 • 2
Llama-3.1-Nemotron-70B SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. nvidia/Llama-3.1-Nemotron-70B-Instruct-HF Text Generation • 71B • Updated Apr 13 • 90.1k • • 2.05k nvidia/Llama-3.1-Nemotron-70B-Reward-HF 71B • Updated Apr 13 • 8.18k • 88 nvidia/HelpSteer2 Viewer • Updated Dec 18, 2024 • 21.4k • 10.8k • 428 HelpSteer2-Preference: Complementing Ratings with Preferences Paper • 2410.01257 • Published Oct 2, 2024 • 25
HelpSteer2-Preference: Complementing Ratings with Preferences Paper • 2410.01257 • Published Oct 2, 2024 • 25
QLIP QLIP is a family of image tokenizers with SOTA reconstruction quality and zero-shot image understanding. nvidia/QLIP-L-14-392 0.7B • Updated Feb 10 • 173 • 11 nvidia/QLIP-B-8-256 0.2B • Updated Feb 10 • 341 • 7 nvidia/QLIP-B-16-256 0.2B • Updated Feb 10 • 117 • 4
DMC LLMs equipped with Dynamic Memory Compression to accelerate generation. nvidia/Llama-2-7B-DMC-4x Updated Dec 22, 2024 • 1 nvidia/Llama-2-7B-DMC-8x Updated Dec 22, 2024 • 2 nvidia/Llama-2-13B-DMC-4x Updated Dec 22, 2024 • 1 nvidia/Llama-2-13B-DMC-8x Updated Dec 22, 2024 • 2
NemoGuard Essential datasets and models for content safety, topic-following, and security guardrails nvidia/Aegis-AI-Content-Safety-Dataset-2.0 Viewer • Updated Jun 9 • 33.4k • 3.24k • 47 nvidia/llama-3.1-nemoguard-8b-topic-control Text Classification • Updated Jun 9 • 2.28k • 16 nvidia/llama-3.1-nemoguard-8b-content-safety Text Classification • Updated Jun 9 • 446 • 25 nvidia/CantTalkAboutThis-Topic-Control-Dataset Viewer • Updated Jan 16 • 1.09k • 100 • 7
NeMo Audio Codecs A series of Neural Audio Codecs nvidia/low-frame-rate-speech-codec-22khz Feature Extraction • Updated Aug 5 • 195 • 17 nvidia/audio-codec-22khz Feature Extraction • Updated Aug 5 • 90 • 5 nvidia/audio-codec-44khz Feature Extraction • Updated Aug 5 • 438 • 21 nvidia/mel-codec-22khz Feature Extraction • Updated Aug 5 • 77 • 3
Optimized ONNX models for NVIDIA RTX GPUs Collection of optimized ONNX model checkpoints for NVIDIA RTX GPUs nvidia/Gemma-2b-it-ONNX-INT4 Updated Nov 15, 2024 • 8 nvidia/Meta-Llama-3.1-8B-Instruct-ONNX-INT4 Updated Nov 15, 2024 • 92 • 5 nvidia/Meta-Llama-3.2-3B-Instruct-ONNX-INT4 Updated Nov 15, 2024 • 8 nvidia/Mistral-7B-Instruct-v0.3-ONNX-INT4 Updated Nov 15, 2024 • 6
OpenMath-2 A collection of models and datasets introduced in "OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data" nvidia/OpenMath2-Llama3.1-8B Text Generation • 8B • Updated Nov 25, 2024 • 6.14k • • 31 nvidia/OpenMath2-Llama3.1-70B Text Generation • 71B • Updated Nov 25, 2024 • 1.4k • 20 nvidia/OpenMathInstruct-2 Viewer • Updated Nov 25, 2024 • 22M • 13.7k • 199 nvidia/OpenMath2-Llama3.1-8B-nemo Updated Nov 25, 2024 • 6
SteerLM A collection of models and datasets relating to SteerLM and HelpSteer. nvidia/HelpSteer2 Viewer • Updated Dec 18, 2024 • 21.4k • 10.8k • 428 nvidia/Llama3-70B-SteerLM-RM Updated Jun 19, 2024 • 72 • 43 nvidia/Nemotron-4-340B-Reward Updated Jun 19, 2024 • 15 • 125 nvidia/HelpSteer Viewer • Updated Dec 18, 2024 • 37.1k • 2.39k • 241
Canary A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤 nvidia/canary-1b Automatic Speech Recognition • Updated Apr 24 • 2.05k • 445 nvidia/canary-1b-flash Automatic Speech Recognition • 0.8B • Updated 22 days ago • 9.69k • 246 nvidia/canary-180m-flash Automatic Speech Recognition • Updated Mar 18 • 2.3k • 73 Training and Inference Efficiency of Encoder-Decoder Speech Models Paper • 2503.05931 • Published Mar 7 • 3
Training and Inference Efficiency of Encoder-Decoder Speech Models Paper • 2503.05931 • Published Mar 7 • 3
OpenMath A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" nvidia/OpenMath-Mistral-7B-v0.1 Updated Feb 16, 2024 • 12 nvidia/OpenMath-Mistral-7B-v0.1-hf Text Generation • 7B • Updated Feb 16, 2024 • 129 • 33 nvidia/OpenMath-CodeLlama-7b-Python Updated Feb 16, 2024 • 2 • 2 nvidia/OpenMath-CodeLlama-7b-Python-hf Text Generation • 7B • Updated Feb 16, 2024 • 75 • 7
NV-Embed NV-Embed is a generalist embedding model encompassing retrieval, reranking, classification, clustering, STS tasks. nvidia/NV-Embed-v1 8B • Updated Nov 30, 2024 • 4.1k • 426 nvidia/NV-Embed-v2 Feature Extraction • 8B • Updated Jul 21 • 112k • 461 nvidia/MM-Embed 8B • Updated Nov 6, 2024 • 2.39k • 60
SSMs A collection of Mamba-2-based research models with 8B parameters trained on 3.5T tokens for comparison with Transformers. nvidia/mamba2-8b-3t-4k Text Generation • Updated Jun 13, 2024 • 19 nvidia/mamba2-hybrid-8b-3t-128k Text Generation • Updated Jun 13, 2024 • 44 nvidia/mamba2-hybrid-8b-3t-32k Text Generation • Updated Jun 13, 2024 • 5 nvidia/mamba2-hybrid-8b-3t-4k Text Generation • Updated Jun 13, 2024 • 73
BigVGAN BigVGAN is a universal neural vocoder that generates audio waveform using mel spectrogram as input. Running 106 106 BigVGAN 🔊 Generate high-quality audio from input audio nvidia/bigvgan_v2_44khz_128band_512x Audio-to-Audio • Updated Sep 5, 2024 • 683k • 54 nvidia/bigvgan_v2_44khz_128band_256x Audio-to-Audio • Updated Sep 5, 2024 • 677 • 7 nvidia/bigvgan_v2_24khz_100band_256x Audio-to-Audio • Updated Sep 5, 2024 • 16k • 17
PS3: Scaling Vision Pre-Training to 4K Resolution Enabling 4k resolution for VLMs, CVPR 2025, https://nvlabs.github.io/PS3/ nvidia/PS3-1.5K-SigLIP2 Image Feature Extraction • 1B • Updated Jul 30 • 193 • 1 nvidia/PS3-4K-SigLIP2 Image Feature Extraction • 1B • Updated Jul 30 • 159 • 1 nvidia/PS3_Lang-1.5K-SigLIP2 Image Feature Extraction • 0.5B • Updated Jul 30 • 107 • 1 nvidia/PS3_Lang-4K-SigLIP2 Image Feature Extraction • 0.6B • Updated Jul 30 • 36
RADIO A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). nvidia/C-RADIOv3-B 0.1B • Updated Jun 25 • 37.2k • 3 nvidia/C-RADIOv3-L 0.3B • Updated Jul 30 • 3.61k • 1 nvidia/C-RADIOv3-H 0.7B • Updated Jun 25 • 179 • 1 nvidia/C-RADIOv3-g 1B • Updated Jul 30 • 999 • 6
NeMo Curator - Classifier Models Classifier models that can be used in NeMo Curator for labelling/filtering datasets. nvidia/domain-classifier Updated Jan 24 • 10.9k • 88 nvidia/quality-classifier-deberta Updated Jan 31 • 1.27k • 66 HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 15.4k • • 193 nvidia/Aegis-AI-Content-Safety-LlamaGuard-Defensive-1.0 Text Classification • Updated Jun 9 • 14k • 25
HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 15.4k • • 193
nvidia/Aegis-AI-Content-Safety-LlamaGuard-Defensive-1.0 Text Classification • Updated Jun 9 • 14k • 25
NVIDIA Nemotron Open, Production-ready Enterprise Models. Nvidia Open Model license. nvidia/NVIDIA-Nemotron-Nano-12B-v2 Text Generation • 12B • Updated 1 day ago • 37.1k • 69 nvidia/NVIDIA-Nemotron-Nano-9B-v2 Text Generation • 9B • Updated 12 days ago • 90.5k • 335 nvidia/NVIDIA-Nemotron-Nano-9B-v2-Base Text Generation • 9B • Updated 15 days ago • 3.28k • 34 nvidia/NVIDIA-Nemotron-Nano-12B-v2-Base Text Generation • 12B • Updated 15 days ago • 3.06k • 73
Llama Nemotron Open, Production-ready Enterprise Models nvidia/Llama-3_3-Nemotron-Super-49B-v1_5 Text Generation • 50B • Updated 21 days ago • 18.2k • 191 nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-FP8 Text Generation • 50B • Updated Jul 31 • 9.64k • 15 nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 Text Generation • 253B • Updated Jul 6 • 3.9k • • 330 nvidia/Llama-3_3-Nemotron-Super-49B-v1 Text Generation • 50B • Updated May 30 • 23.7k • 319
Nemotron-Pre-Training-Dataset nvidia/Nemotron-Pretraining-Dataset-sample Viewer • Updated 16 days ago • 27.7k • 1.82k • 18 nvidia/Nemotron-CC-Math-v1 Viewer • Updated 9 days ago • 145M • 9.9k • 39 nvidia/Nemotron-CC-v2 Viewer • Updated 16 days ago • 5.81B • 63.1k • 70 nvidia/Nemotron-Pretraining-SFT-v1 Viewer • Updated 16 days ago • 358M • 3.87k • 20
BioNeMo Accelerated models for digital biology by the NVIDIA BioNeMo team. https://www.nvidia.com/en-us/clara/biopharma/ nvidia/AMPLIFY_350M Fill-Mask • 0.4B • Updated about 16 hours ago • 43 • 4 nvidia/AMPLIFY_120M Fill-Mask • 0.1B • Updated about 16 hours ago • 664 • 4 nvidia/esm2_t6_8M_UR50D Fill-Mask • 0.0B • Updated 14 days ago • 2.37k nvidia/esm2_t12_35M_UR50D Fill-Mask • 0.0B • Updated 14 days ago • 43
OpenReasoning-Nemotron Collection of models for OpenReasoning-Nemotron which are trained on 5M reasoning traces for Math, Code and Science. nvidia/OpenReasoning-Nemotron-1.5B Text Generation • 2B • Updated 13 days ago • 2.75k • 43 nvidia/OpenReasoning-Nemotron-7B Text Generation • 8B • Updated 13 days ago • 4.7k • • 42 nvidia/OpenReasoning-Nemotron-14B Text Generation • 15B • Updated 13 days ago • 1.88k • 39 nvidia/OpenReasoning-Nemotron-32B Text Generation • 33B • Updated 13 days ago • 2.65k • • 112
Cosmos-Predict2 World Foundation Model for Future Prediction nvidia/Cosmos-Predict2-0.6B-Text2Image Text-to-Image • Updated 10 days ago • 317 • 5 nvidia/Cosmos-Predict2-2B-Text2Image Text-to-Image • Updated Jun 17 • 892 • 63 nvidia/Cosmos-Predict2-2B-Video2World Image-to-Video • Updated Jul 23 • 1.64k • 30 nvidia/Cosmos-Predict2-14B-Text2Image Text-to-Image • Updated Jun 17 • 475 • 43
Reward Models Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge nvidia/Llama-3_3-Nemotron-Super-49B-GenRM Text Generation • 50B • Updated Jun 26 • 150 • 16 nvidia/Llama-3_3-Nemotron-Super-49B-GenRM-Multilingual Text Generation • 50B • Updated Jun 26 • 296 • 6 nvidia/Llama-3.3-Nemotron-70B-Reward Text Generation • 71B • Updated Jun 26 • 392 • 2 nvidia/Llama-3.3-Nemotron-70B-Reward-Multilingual Text Generation • 71B • Updated Jun 26 • 1.23k • 10
nvidia/Llama-3_3-Nemotron-Super-49B-GenRM-Multilingual Text Generation • 50B • Updated Jun 26 • 296 • 6
nvidia/Llama-3.3-Nemotron-70B-Reward-Multilingual Text Generation • 71B • Updated Jun 26 • 1.23k • 10
GEN3C 3D-Informed World-Consistent Video Generation with Precise Camera Control nvidia/GEN3C-Cosmos-7B Updated Jun 18 • 295 • 22 GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control Paper • 2503.03751 • Published Mar 5 • 23 nvidia/GEN3C-Testing-Example Viewer • Updated 20 days ago • 10 • 838 • 2
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control Paper • 2503.03751 • Published Mar 5 • 23
AceReason Math and Code reasoning model trained through reinforcement learning (RL) nvidia/AceReason-Nemotron-14B Text Generation • 15B • Updated Jun 17 • 12.7k • • 91 nvidia/AceReason-Nemotron-7B Text Generation • 8B • Updated Jun 17 • 6.7k • • 19 nvidia/AceReason-Nemotron-1.1-7B Text Generation • 8B • Updated Jul 11 • 7.37k • • 56 nvidia/AceReason-Math Viewer • Updated Jun 18 • 49.6k • 1.7k • 33
Model Optimizer A collection of generative models quantized and optimized with TensorRT Model Optimizer. nvidia/DeepSeek-R1-0528-FP4 Text Generation • Updated 20 days ago • 67k • 36 nvidia/DeepSeek-R1-FP4 Text Generation • Updated Jun 6 • 14.2k • 264 nvidia/Llama-3.3-70B-Instruct-FP4 41B • Updated 20 days ago • 28.3k • 22 nvidia/Llama-3.3-70B-Instruct-FP8 71B • Updated 20 days ago • 60.2k • 7
Nemotron-H Mamba-Transformer hybrid models nvidia/Nemotron-H-47B-Reasoning-128K Text Generation • 47B • Updated Jul 11 • 747 • 18 nvidia/Nemotron-H-8B-Reasoning-128K Text Generation • 8B • Updated Jul 11 • 10.3k • 22 nvidia/Nemotron-H-8B-Reasoning-128K-FP8 Text Generation • 8B • Updated 21 days ago • 201 • 12 nvidia/Nemotron-H-47B-Reasoning-128K-FP8 Text Generation • 47B • Updated 21 days ago • 106 • 5
Cosmos-Embed1 Joint video-text embedding for physical AI nvidia/Cosmos-Embed1-224p 1B • Updated Jun 10 • 17.9k • 4 nvidia/Cosmos-Embed1-336p 1B • Updated Jun 10 • 1.03k nvidia/Cosmos-Embed1-448p 1B • Updated Jun 10 • 1.32k • 2 Build error Cosmos Embed1 🚀 Cosmos-Embed1 demo app
Describe Anything Multimodal Large Language Models for Detailed Localized Image and Video Captioning Runtime error 336 336 Describe Anything ⚡ Describe masked parts of images using prompts nvidia/DAM-3B Image-Text-to-Text • Updated May 7 • 6.47k • 127 nvidia/DAM-3B-Video Image-Text-to-Text • Updated May 7 • 933 • 56 nvidia/DAM-3B-Self-Contained Image-Text-to-Text • Updated May 7 • 6.17k • 23
AceMath-RL Math reasoning models trained through reinforcement learning (RL) nvidia/AceMath-RL-Nemotron-7B Text Generation • 8B • Updated Apr 23 • 2.75k • • 23
OpenMathReasoning Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset" nvidia/OpenMathReasoning Viewer • Updated May 27 • 5.68M • 7.84k • 335 nvidia/OpenMath-Nemotron-1.5B Text Generation • 2B • Updated Apr 30 • 3.56k • • 24 nvidia/OpenMath-Nemotron-7B Text Generation • 8B • Updated Apr 30 • 1.55k • • 9 nvidia/OpenMath-Nemotron-14B Text Generation • 15B • Updated Apr 30 • 1.79k • 12
OpenCodeReasoning Reasoning data for supervised finetuning of LLMs to advance data distillation for competitive coding nvidia/OpenCodeReasoning Viewer • Updated May 4 • 753k • 2.74k • 492 OpenCodeReasoning: Advancing Data Distillation for Competitive Coding Paper • 2504.01943 • Published Apr 2 • 16 nvidia/OpenCodeReasoning-Nemotron-7B Text Generation • 8B • Updated May 7 • 1.26k • • 37 nvidia/OpenCodeReasoning-Nemotron-14B Text Generation • 15B • Updated May 7 • 547 • 18
OpenCodeReasoning: Advancing Data Distillation for Competitive Coding Paper • 2504.01943 • Published Apr 2 • 16
OpenCodeReasoning-II Reasoning data for supervised finetuning of LLMs to advance code generation and critique nvidia/OpenCodeReasoning-2 Viewer • Updated May 17 • 2.16M • 2.04k • 39 nvidia/OpenCodeReasoning Viewer • Updated May 4 • 753k • 2.74k • 492
Llama Nemotron Feedback-Edit Inference-Time Scaling Novel ITS approach for open-ended tasks - No. 1 on Arena Hard on 18 Mar 2025 nvidia/Llama-3.3-Nemotron-70B-Feedback Text Generation • 71B • Updated Mar 18 • 83 • 7 nvidia/Llama-3.3-Nemotron-70B-Edit Text Generation • 71B • Updated Mar 18 • 104 • 3 nvidia/Llama-3.3-Nemotron-70B-Select Text Generation • 71B • Updated Mar 18 • 1.24k • 10 nvidia/HelpSteer3 Viewer • Updated Jul 2 • 99k • 3.09k • 78
Scoring Verifiers Benchmarks for evaluating synthetic verifiers like test case generation and code reward models (as found in https://www.arxiv.org/abs/2502.13820). nvidia/Scoring-Verifiers Updated Apr 1 • 56 • 7 Scoring Verifiers: Evaluating Synthetic Verification in Code and Reasoning Paper • 2502.13820 • Published Feb 19
Scoring Verifiers: Evaluating Synthetic Verification in Code and Reasoning Paper • 2502.13820 • Published Feb 19
Nemotron-UltraLong nvidia/Llama-3.1-Nemotron-8B-UltraLong-1M-Instruct Text Generation • 8B • Updated Apr 17 • 6.12k • 51 nvidia/Llama-3.1-Nemotron-8B-UltraLong-4M-Instruct Text Generation • 8B • Updated Apr 17 • 5.46k • 119 nvidia/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct Text Generation • 8B • Updated Apr 17 • 1.18k • 15
nvidia/Llama-3.1-Nemotron-8B-UltraLong-1M-Instruct Text Generation • 8B • Updated Apr 17 • 6.12k • 51
nvidia/Llama-3.1-Nemotron-8B-UltraLong-4M-Instruct Text Generation • 8B • Updated Apr 17 • 5.46k • 119
nvidia/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct Text Generation • 8B • Updated Apr 17 • 1.18k • 15
Cosmos-Reason1 Multimodal world understanding through reasoning nvidia/Cosmos-Reason1-7B Image-Text-to-Text • 8B • Updated 28 days ago • 355k • 168 nvidia/Cosmos-Reason1-RL-Dataset Viewer • Updated May 20 • 892 • 417 • 11 nvidia/Cosmos-Reason1-Benchmark Viewer • Updated May 20 • 510 • 630 • 6 nvidia/Cosmos-Reason1-SFT-Dataset Viewer • Updated May 20 • 1.71M • 1.6k • 19
Cosmos-Transfer1 Multimodal Conditional World Generation for World2World Transfer nvidia/Cosmos-Transfer1-7B Updated Jul 11 • 3.27k • 49 nvidia/Cosmos-Transfer1-7B-Sample-AV Updated Apr 9 • 1.52k • 14 nvidia/Cosmos-Transfer1-7B-Sample-AV-Data-Example Viewer • Updated Mar 19 • 130 • 71 • 6 nvidia/Cosmos-Transfer1-7B-4KUpscaler Updated Mar 20 • 56 • 6
Cosmos-Tokenize1 A suite of image and video tokenizers nvidia/Cosmos-Tokenize1-CI8x8-360p Updated Mar 18 • 93 • 2 nvidia/Cosmos-Tokenize1-CI16x16-360p Updated Mar 18 • 84 • 1 nvidia/Cosmos-Tokenize1-CV4x8x8-360p Updated Mar 18 • 88 • 1 nvidia/Cosmos-Tokenize1-CV8x8x8-720p Updated Apr 23 • 3.27k • 3
Cosmos-Predict1 World Foundation Model for Future Prediction nvidia/Cosmos-Predict1-4B Updated Apr 8 • 95 • 2 nvidia/Cosmos-Predict1-5B-Video2World Updated Apr 8 • 82 • 3 nvidia/Cosmos-Predict1-7B-Text2World Updated Apr 8 • 186 • 4 nvidia/Cosmos-Predict1-7B-Video2World Updated Apr 8 • 127 • 2
Cosmos-Tokenizer A suite of image and video tokenizers nvidia/Cosmos-0.1-Tokenizer-CI8x8 Updated Nov 11, 2024 • 127 • 32 nvidia/Cosmos-0.1-Tokenizer-CI16x16 Updated Dec 25, 2024 • 107 • 8 nvidia/Cosmos-0.1-Tokenizer-DI8x8 Updated Dec 25, 2024 • 94 • 11 nvidia/Cosmos-0.1-Tokenizer-DI16x16 Updated Dec 25, 2024 • 84 • 9
Llama-3.1-Nemotron-70B SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. nvidia/Llama-3.1-Nemotron-70B-Instruct-HF Text Generation • 71B • Updated Apr 13 • 90.1k • • 2.05k nvidia/Llama-3.1-Nemotron-70B-Reward-HF 71B • Updated Apr 13 • 8.18k • 88 nvidia/HelpSteer2 Viewer • Updated Dec 18, 2024 • 21.4k • 10.8k • 428 HelpSteer2-Preference: Complementing Ratings with Preferences Paper • 2410.01257 • Published Oct 2, 2024 • 25
HelpSteer2-Preference: Complementing Ratings with Preferences Paper • 2410.01257 • Published Oct 2, 2024 • 25
Physical AI Collection of commercial-grade datasets for physical AI developers nvidia/PhysicalAI-SmartSpaces Updated 6 days ago • 37.1k • 44 nvidia/PhysicalAI-Robotics-Manipulation-Kitchen Viewer • Updated May 15 • 405k • 2.47k • 10 nvidia/PhysicalAI-Robotics-GraspGen Viewer • Updated Jun 21 • 25.5k • 672 • 22 nvidia/PhysicalAI-Robotics-Manipulation-SingleArm Updated May 15 • 14.2k • 12
QLIP QLIP is a family of image tokenizers with SOTA reconstruction quality and zero-shot image understanding. nvidia/QLIP-L-14-392 0.7B • Updated Feb 10 • 173 • 11 nvidia/QLIP-B-8-256 0.2B • Updated Feb 10 • 341 • 7 nvidia/QLIP-B-16-256 0.2B • Updated Feb 10 • 117 • 4
Cosmos The collection of Cosmos models nvidia/Cosmos-1.0-Guardrail Updated Jun 11 • 1.29k • 56 nvidia/Cosmos-1.0-Autoregressive-4B Updated Feb 11 • 37 • 54
DMC LLMs equipped with Dynamic Memory Compression to accelerate generation. nvidia/Llama-2-7B-DMC-4x Updated Dec 22, 2024 • 1 nvidia/Llama-2-7B-DMC-8x Updated Dec 22, 2024 • 2 nvidia/Llama-2-13B-DMC-4x Updated Dec 22, 2024 • 1 nvidia/Llama-2-13B-DMC-8x Updated Dec 22, 2024 • 2
AceMath We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark. nvidia/AceMath-1.5B-Instruct Text Generation • 2B • Updated Jan 17 • 3.58k • 13 nvidia/AceMath-7B-Instruct Text Generation • 8B • Updated Jan 17 • 1.67k • • 26 nvidia/AceMath-72B-Instruct Text Generation • 73B • Updated Jan 17 • 1.84k • 19 nvidia/AceMath-7B-RM Text Generation • 7B • Updated Jan 17 • 8.68k • 6
NemoGuard Essential datasets and models for content safety, topic-following, and security guardrails nvidia/Aegis-AI-Content-Safety-Dataset-2.0 Viewer • Updated Jun 9 • 33.4k • 3.24k • 47 nvidia/llama-3.1-nemoguard-8b-topic-control Text Classification • Updated Jun 9 • 2.28k • 16 nvidia/llama-3.1-nemoguard-8b-content-safety Text Classification • Updated Jun 9 • 446 • 25 nvidia/CantTalkAboutThis-Topic-Control-Dataset Viewer • Updated Jan 16 • 1.09k • 100 • 7
Eagle 2 Eagle 2 is a family of frontier vision-language models with vision-centric design. The model supports 4K HD input, long-context video, and grounding. nvidia/Eagle2-1B Image-Text-to-Text • 1B • Updated Apr 27 • 4.22k • 25 nvidia/Eagle2-2B Image-Text-to-Text • 2B • Updated Apr 27 • 1.31k • 29 nvidia/Eagle2-9B Image-Text-to-Text • 9B • Updated Jan 28 • 379 • 61
NeMo Audio Codecs A series of Neural Audio Codecs nvidia/low-frame-rate-speech-codec-22khz Feature Extraction • Updated Aug 5 • 195 • 17 nvidia/audio-codec-22khz Feature Extraction • Updated Aug 5 • 90 • 5 nvidia/audio-codec-44khz Feature Extraction • Updated Aug 5 • 438 • 21 nvidia/mel-codec-22khz Feature Extraction • Updated Aug 5 • 77 • 3
Hymba A series of Hybrid Small Language Models. nvidia/Hymba-1.5B-Instruct Text Generation • 2B • Updated Jan 2 • 436 • 235 nvidia/Hymba-1.5B-Base Text Generation • 2B • Updated Jan 2 • 493 • 149 Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published Nov 20, 2024 • 46
Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published Nov 20, 2024 • 46
Optimized ONNX models for NVIDIA RTX GPUs Collection of optimized ONNX model checkpoints for NVIDIA RTX GPUs nvidia/Gemma-2b-it-ONNX-INT4 Updated Nov 15, 2024 • 8 nvidia/Meta-Llama-3.1-8B-Instruct-ONNX-INT4 Updated Nov 15, 2024 • 92 • 5 nvidia/Meta-Llama-3.2-3B-Instruct-ONNX-INT4 Updated Nov 15, 2024 • 8 nvidia/Mistral-7B-Instruct-v0.3-ONNX-INT4 Updated Nov 15, 2024 • 6
NVLM 1.0 A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. nvidia/NVLM-D-72B Image-Text-to-Text • 79B • Updated Jan 14 • 73.5k • 773 nvidia/NVLM-D-72B-mcore Image-Text-to-Text • Updated Jan 14 • 6
OpenMath-2 A collection of models and datasets introduced in "OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data" nvidia/OpenMath2-Llama3.1-8B Text Generation • 8B • Updated Nov 25, 2024 • 6.14k • • 31 nvidia/OpenMath2-Llama3.1-70B Text Generation • 71B • Updated Nov 25, 2024 • 1.4k • 20 nvidia/OpenMathInstruct-2 Viewer • Updated Nov 25, 2024 • 22M • 13.7k • 199 nvidia/OpenMath2-Llama3.1-8B-nemo Updated Nov 25, 2024 • 6
Nemotron 4 340B Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. nvidia/Nemotron-4-340B-Instruct Updated Jun 24, 2024 • 40 • 684 nvidia/Nemotron-4-340B-Reward Updated Jun 19, 2024 • 15 • 125 nvidia/Nemotron-4-340B-Base Updated Jun 28, 2024 • 24 • 146 nvidia/HelpSteer2 Viewer • Updated Dec 18, 2024 • 21.4k • 10.8k • 428
SteerLM A collection of models and datasets relating to SteerLM and HelpSteer. nvidia/HelpSteer2 Viewer • Updated Dec 18, 2024 • 21.4k • 10.8k • 428 nvidia/Llama3-70B-SteerLM-RM Updated Jun 19, 2024 • 72 • 43 nvidia/Nemotron-4-340B-Reward Updated Jun 19, 2024 • 15 • 125 nvidia/HelpSteer Viewer • Updated Dec 18, 2024 • 37.1k • 2.39k • 241
Parakeet NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. nvidia/parakeet-rnnt-1.1b Automatic Speech Recognition • Updated Feb 18 • 9.27k • 158 nvidia/parakeet-ctc-1.1b Automatic Speech Recognition • 1B • Updated Jul 29 • 2.87k • 33 nvidia/parakeet-rnnt-0.6b Automatic Speech Recognition • Updated Feb 18 • 175k • 10 nvidia/parakeet-ctc-0.6b Automatic Speech Recognition • Updated Aug 22, 2024 • 3.05k • 16
Canary A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤 nvidia/canary-1b Automatic Speech Recognition • Updated Apr 24 • 2.05k • 445 nvidia/canary-1b-flash Automatic Speech Recognition • 0.8B • Updated 22 days ago • 9.69k • 246 nvidia/canary-180m-flash Automatic Speech Recognition • Updated Mar 18 • 2.3k • 73 Training and Inference Efficiency of Encoder-Decoder Speech Models Paper • 2503.05931 • Published Mar 7 • 3
Training and Inference Efficiency of Encoder-Decoder Speech Models Paper • 2503.05931 • Published Mar 7 • 3
InstructRetro InstructRetro is an autoregressive decoder-only language model (LM) with retrieval-augmented pretraining and instruction tuning. nvidia/retro-48b-instruct-4k Text Generation • Updated May 29, 2024 • 20 nvidia/retro-8b-instruct-4k Text Generation • Updated May 29, 2024 • 14
OpenMath A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" nvidia/OpenMath-Mistral-7B-v0.1 Updated Feb 16, 2024 • 12 nvidia/OpenMath-Mistral-7B-v0.1-hf Text Generation • 7B • Updated Feb 16, 2024 • 129 • 33 nvidia/OpenMath-CodeLlama-7b-Python Updated Feb 16, 2024 • 2 • 2 nvidia/OpenMath-CodeLlama-7b-Python-hf Text Generation • 7B • Updated Feb 16, 2024 • 75 • 7
RLHF A collection of models trained with Reinforcement Learning from Human Feedback (RLHF). nvidia/NV-Llama2-70B-RLHF-Chat Text Generation • Updated Mar 9, 2024 • 4 nvidia/NV-Llama2-13B-RLHF-RM Text Generation • Updated Mar 9, 2024 • 36 • 3 nvidia/sft_datablend_v1 Viewer • Updated Mar 9, 2024 • 128k • 50 • 14 nvidia/Daring-Anteater Viewer • Updated Jun 17, 2024 • 99.5k • 1.01k • 26
NV-Embed NV-Embed is a generalist embedding model encompassing retrieval, reranking, classification, clustering, STS tasks. nvidia/NV-Embed-v1 8B • Updated Nov 30, 2024 • 4.1k • 426 nvidia/NV-Embed-v2 Feature Extraction • 8B • Updated Jul 21 • 112k • 461 nvidia/MM-Embed 8B • Updated Nov 6, 2024 • 2.39k • 60
Llama3-ChatQA-1.5 Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). nvidia/Llama3-ChatQA-1.5-8B Text Generation • 8B • Updated May 24, 2024 • 11.6k • 554 nvidia/Llama3-ChatQA-1.5-70B Text Generation • 71B • Updated May 24, 2024 • 151 • • 333 nvidia/ChatRAG-Bench Viewer • Updated May 24, 2024 • 34.6k • 1.05k • 113 nvidia/ChatQA-Training-Data Viewer • Updated Jun 4, 2024 • 442k • 614 • 172
SSMs A collection of Mamba-2-based research models with 8B parameters trained on 3.5T tokens for comparison with Transformers. nvidia/mamba2-8b-3t-4k Text Generation • Updated Jun 13, 2024 • 19 nvidia/mamba2-hybrid-8b-3t-128k Text Generation • Updated Jun 13, 2024 • 44 nvidia/mamba2-hybrid-8b-3t-32k Text Generation • Updated Jun 13, 2024 • 5 nvidia/mamba2-hybrid-8b-3t-4k Text Generation • Updated Jun 13, 2024 • 73
Nemotron 3 8B The Nemotron 3 8B Family of models is optimized for building production-ready generative AI applications for the enterprise. nvidia/nemotron-3-8b-base-4k Text Generation • Updated Feb 9, 2024 • 4 • 91 nvidia/nemotron-3-8b-chat-4k-sft Text Generation • Updated Feb 9, 2024 • 10 nvidia/nemotron-3-8b-chat-4k-rlhf Text Generation • Updated Feb 9, 2024 • 6 • 27 nvidia/nemotron-3-8b-chat-4k-steerlm Text Generation • Updated Feb 9, 2024 • 2 • 22
BigVGAN BigVGAN is a universal neural vocoder that generates audio waveform using mel spectrogram as input. Running 106 106 BigVGAN 🔊 Generate high-quality audio from input audio nvidia/bigvgan_v2_44khz_128band_512x Audio-to-Audio • Updated Sep 5, 2024 • 683k • 54 nvidia/bigvgan_v2_44khz_128band_256x Audio-to-Audio • Updated Sep 5, 2024 • 677 • 7 nvidia/bigvgan_v2_24khz_100band_256x Audio-to-Audio • Updated Sep 5, 2024 • 16k • 17
MambaVision MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. nvidia/MambaVision-L3-512-21K Image Classification • 0.7B • Updated Mar 29 • 292 • 52 nvidia/MambaVision-L3-256-21K Image Classification • 0.7B • Updated Mar 29 • 67 • 7 nvidia/MambaVision-L2-512-21K Image Classification • 0.2B • Updated Mar 29 • 166 • 3 nvidia/MambaVision-L-21K Image Classification • 0.2B • Updated Mar 29 • 107 • 4
PS3: Scaling Vision Pre-Training to 4K Resolution Enabling 4k resolution for VLMs, CVPR 2025, https://nvlabs.github.io/PS3/ nvidia/PS3-1.5K-SigLIP2 Image Feature Extraction • 1B • Updated Jul 30 • 193 • 1 nvidia/PS3-4K-SigLIP2 Image Feature Extraction • 1B • Updated Jul 30 • 159 • 1 nvidia/PS3_Lang-1.5K-SigLIP2 Image Feature Extraction • 0.5B • Updated Jul 30 • 107 • 1 nvidia/PS3_Lang-4K-SigLIP2 Image Feature Extraction • 0.6B • Updated Jul 30 • 36
Minitron A family of compressed models obtained via pruning and knowledge distillation nvidia/Mistral-NeMo-Minitron-8B-Base Text Generation • 8B • Updated Aug 22, 2024 • 4.32k • 177 nvidia/Mistral-NeMo-Minitron-8B-Instruct Text Generation • 8B • Updated Oct 9, 2024 • 2.07k • 80 nvidia/Llama-3_1-Nemotron-51B-Instruct Text Generation • 52B • Updated Jul 6 • 9.52k • 210 nvidia/Llama-3.1-Minitron-4B-Width-Base Text Generation • 5B • Updated Feb 14 • 4.43k • 191
RADIO A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). nvidia/C-RADIOv3-B 0.1B • Updated Jun 25 • 37.2k • 3 nvidia/C-RADIOv3-L 0.3B • Updated Jul 30 • 3.61k • 1 nvidia/C-RADIOv3-H 0.7B • Updated Jun 25 • 179 • 1 nvidia/C-RADIOv3-g 1B • Updated Jul 30 • 999 • 6
Llama3-ChatQA-2 This is the collection that presents ChatQA-2, a suite of 128K long-context models, that also have exceptional RAG capabilities nvidia/Llama3-ChatQA-2-70B Text Generation • Updated Sep 10, 2024 • 131 • 12 nvidia/Llama3-ChatQA-2-8B Text Generation • Updated Sep 10, 2024 • 796 • 16 nvidia/ChatQA2-Long-SFT-data Viewer • Updated Sep 9, 2024 • 117k • 293 • 31
NeMo Curator - Classifier Models Classifier models that can be used in NeMo Curator for labelling/filtering datasets. nvidia/domain-classifier Updated Jan 24 • 10.9k • 88 nvidia/quality-classifier-deberta Updated Jan 31 • 1.27k • 66 HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 15.4k • • 193 nvidia/Aegis-AI-Content-Safety-LlamaGuard-Defensive-1.0 Text Classification • Updated Jun 9 • 14k • 25
HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 15.4k • • 193
nvidia/Aegis-AI-Content-Safety-LlamaGuard-Defensive-1.0 Text Classification • Updated Jun 9 • 14k • 25