Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
deepseek-ai
/
DeepSeek-R1
like
12.7k
Follow
DeepSeek
95.6k
Text Generation
Transformers
Safetensors
deepseek_v3
conversational
custom_code
text-generation-inference
fp8
arxiv:
2501.12948
License:
mit
Model card
Files
Files and versions
xet
Community
224
Train
Deploy
Use this model
Set Max Model Length to correct value
#233
by
chandra-reddy
- opened
8 days ago
base:
refs/heads/main
←
from:
refs/pr/233
Discussion
Files changed
+0
-0
This PR is in
draft mode
Files changed (0)
hide
show