Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
's Collections
Llama Nemotron
Describe Anything
AceMath-RL
OpenCodeReasoning-2
OpenMathReasoning
Nemotron-H
OpenCodeReasoning
Llama Nemotron Feedback-Edit Inference-Time Scaling
Scoring Verifiers
Nemotron-UltraLong
Cosmos Transfer1
Cosmos Tokenize1
Cosmos Predict1
Llama-3.1-Nemotron-70B
Physical AI
NVILA-Speech-Audio-Setups
QLIP
Cosmos
DMC
AceMath
NemoGuard
Cosmos Tokenizer
Eagle 2
NeMo Audio Codecs
Hymba
Optimized ONNX models for NVIDIA RTX GPUs
Riva
NVLM 1.0
OpenMath-2
Nemotron 4 340B
SteerLM
Parakeet
Canary
InstructRetro
OpenMath
RLHF
NV-Embed
Llama3-ChatQA-1.5
SSMs
Nemotron 3 8B
BigVGAN
MambaVision
Minitron
RADIO
Model Optimizer
Llama3-ChatQA-2
NeMo Curator - Classifier Models
Model Optimizer
updated
4 days ago
A collection of generative models quantized and optimized with TensorRT Model Optimizer.
Upvote
20
+10
nvidia/Llama-3.1-8B-Instruct-FP8
Text Generation
•
Updated
Mar 26
•
9.03k
•
23
nvidia/Llama-3.1-405B-Instruct-FP8
Text Generation
•
Updated
Feb 26
•
14.2k
•
10
nvidia/Llama-3.1-70B-Instruct-FP8
Text Generation
•
Updated
Feb 26
•
7.29k
•
12
nvidia/Llama-3.1-8B-Medusa-FP8
Updated
Jan 31
•
167
•
6
nvidia/Llama-3.3-70B-Instruct-FP4
Updated
Feb 26
•
3.84k
•
19
nvidia/Llama-3.1-405B-Instruct-FP4
Updated
Feb 26
•
3.37k
•
5
nvidia/DeepSeek-R1-FP4
Text Generation
•
Updated
Apr 3
•
30.3k
•
239
Upvote
20
+16
Share collection
View history
Collection guide
Browse collections