Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
moonshotai
/
Moonlight-16B-A3B-Instruct
like
153
Follow
Moonshot AI
553
Text Generation
Transformers
Safetensors
deepseek_v3
conversational
custom_code
arxiv:
2502.16982
License:
mit
Model card
Files
Files and versions
Community
14
Train
Deploy
Use this model
hotfix cache_utils max_cache_length
#10
by
hewr2010
- opened
Mar 3
base:
refs/heads/main
β
from:
refs/pr/10
Discussion
Files changed
+1
-1
hewr2010
Moonshot AI org
Mar 3
No description provided.
π
1
1
+
fix: update max_cache_length api
cc3d2856
hewr2010
changed pull request status to
open
Mar 3
hewr2010
changed pull request status to
merged
Mar 3
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Your need to confirm your account before you can post a new comment.
Comment
Β·
Sign up
or
log in
to comment