Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
nvidia 's Collections
Llama Nemotron
Describe Anything
AceMath-RL
OpenCodeReasoning-2
OpenMathReasoning
Nemotron-H
OpenCodeReasoning
Llama Nemotron Feedback-Edit Inference-Time Scaling
Scoring Verifiers
Nemotron-UltraLong
Cosmos Transfer1
Cosmos Tokenize1
Cosmos Predict1
Llama-3.1-Nemotron-70B
Physical AI
NVILA-Speech-Audio-Setups
QLIP
Cosmos
DMC
AceMath
NemoGuard
Cosmos Tokenizer
Eagle 2
NeMo Audio Codecs
Hymba
Optimized ONNX models for NVIDIA RTX GPUs
Riva
NVLM 1.0
OpenMath-2
Nemotron 4 340B
SteerLM
Parakeet
Canary
InstructRetro
OpenMath
RLHF
NV-Embed
Llama3-ChatQA-1.5
SSMs
Nemotron 3 8B
BigVGAN
MambaVision
Minitron
RADIO
Model Optimizer
Llama3-ChatQA-2
NeMo Curator - Classifier Models

Optimized ONNX models for NVIDIA RTX GPUs

updated 4 days ago

Collection of optimized ONNX model checkpoints for NVIDIA RTX GPUs

Upvote
10

  • nvidia/Gemma-2b-it-ONNX-INT4

    Updated Nov 15, 2024 • 6

  • nvidia/Meta-Llama-3.1-8B-Instruct-ONNX-INT4

    Updated Nov 15, 2024 • 53 • 5

  • nvidia/Meta-Llama-3.2-3B-Instruct-ONNX-INT4

    Updated Nov 15, 2024 • 7

  • nvidia/Mistral-7B-Instruct-v0.3-ONNX-INT4

    Updated Nov 15, 2024 • 2

  • nvidia/Phi-3.5-mini-Instruct-ONNX-INT4

    Updated Nov 15, 2024 • 1

  • nvidia/Mistral-Nemo-12B-Instruct-ONNX-INT4

    Updated Nov 15, 2024 • 2

  • nvidia/Nemotron-Mini-4B-Instruct-ONNX-INT4

    Updated Nov 18, 2024 • 5
Upvote
10
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs