Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

nvidia
/
Llama-3_1-Nemotron-51B-Instruct

Text Generation
Transformers
Safetensors
PyTorch
English
nemotron-nas
nvidia
llama-3
conversational
custom_code
Model card Files Files and versions Community
24
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Can Llama-3.1- Nemotron-40B-Instruct be released as well?

1
#24 opened 4 months ago by
tdh111

What is the context size this model was trained on?

2
#23 opened 4 months ago by
treehugg3

Modified llama.cpp to generate GGUFs for Llama-3_1-Nemotron-51

2
#22 opened 5 months ago by
ymcki

Documentation about the linear attention used in some layers of this model?

#21 opened 5 months ago by
ymcki

Comparison to the 70B model?

1
1
#20 opened 6 months ago by
AIGUYCONTENT

Update README.md

#11 opened 7 months ago by
Vlad748283847

vLLM compatible?

5
3
#10 opened 7 months ago by
nickandbro

AttributeError: 'DeciLMConfig'

3
#9 opened 7 months ago by
bluenevus

fp8 / int8 inference - use bitsandbytes or awq

2
#8 opened 8 months ago by
dtanow

GGUF possible ?

4
2
#5 opened 8 months ago by
gopi87

fine-tuning

#1 opened 8 months ago by
kzmaker
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs