Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
SGLang
/
DeepSeek-V3-NextN
like
2
Transformers
Safetensors
deepseek_v3
custom_code
Inference Endpoints
fp8
arxiv:
2412.19437
Model card
Files
Files and versions
Community
2
Train
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (0)
Add text-generation pipeline tag and MIT license
#2 opened 3 days ago by
nielsr
Is this MTP head just for predicting one token ahead?
#1 opened 5 days ago by
RonanMcGovern