Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
nvidia 's Collections
Nemotron-H
Llama Nemotron
GEN3C
Model Optimizer
AceReason
Describe Anything
AceMath-RL
OpenMathReasoning
OpenCodeReasoning
OpenCodeReasoning-II
Llama Nemotron Feedback-Edit Inference-Time Scaling
Scoring Verifiers
Nemotron-UltraLong
Cosmos-Reason1
Cosmos-Transfer1
Cosmos-Tokenize1
Cosmos-Predict1
Cosmos-Tokenizer
Llama-3.1-Nemotron-70B
Physical AI
NVILA-Speech-Audio-Setups
QLIP
Cosmos
DMC
AceMath
NemoGuard
Eagle 2
NeMo Audio Codecs
Hymba
Optimized ONNX models for NVIDIA RTX GPUs
Riva
NVLM 1.0
OpenMath-2
Nemotron 4 340B
SteerLM
Parakeet
Canary
InstructRetro
OpenMath
RLHF
NV-Embed
Llama3-ChatQA-1.5
SSMs
Nemotron 3 8B
BigVGAN
MambaVision
PS3: Scaling Vision Pre-Training to 4K Resolution
Minitron
RADIO
Llama3-ChatQA-2
NeMo Curator - Classifier Models

Canary

updated 2 days ago

A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤

Upvote
21

  • nvidia/canary-1b

    Automatic Speech Recognition • Updated Apr 24 • 15.8k • 428

  • nvidia/canary-1b-flash

    Automatic Speech Recognition • Updated Mar 18 • 324k • 206

  • nvidia/canary-180m-flash

    Automatic Speech Recognition • Updated Mar 18 • 7.71k • 66

  • Training and Inference Efficiency of Encoder-Decoder Speech Models

    Paper • 2503.05931 • Published Mar 7 • 3
Upvote
21
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs