Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
25
1
39
CobraMamba
CobraMamba
Follow
loretoparisi's profile picture
valdanito's profile picture
John6666's profile picture
13 followers
·
1 following
https://github.com/chi2liu
633WHU
AI & ML interests
None yet
Recent Activity
published
a model
2 months ago
CobraMamba/DeepSeek-R1-Distill-Qwen-1.5B-GSPO3
updated
a model
2 months ago
CobraMamba/DeepSeek-R1-Distill-Qwen-1.5B-GSPO2
published
a model
2 months ago
CobraMamba/DeepSeek-R1-Distill-Qwen-1.5B-GSPO2
View all activity
Organizations
None yet
CobraMamba
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
published
a model
2 months ago
CobraMamba/DeepSeek-R1-Distill-Qwen-1.5B-GSPO3
Updated
Aug 11
updated
a model
2 months ago
CobraMamba/DeepSeek-R1-Distill-Qwen-1.5B-GSPO2
Text Generation
•
2B
•
Updated
Aug 10
•
11
published
2 models
2 months ago
CobraMamba/DeepSeek-R1-Distill-Qwen-1.5B-GSPO2
Text Generation
•
2B
•
Updated
Aug 10
•
11
CobraMamba/DeepSeek-R1-Distill-Qwen-1.5B-GSPO
Updated
Aug 10
updated
a model
5 months ago
CobraMamba/Qwen3-30B-A3B-AWQ-4Bit
Text Generation
•
5B
•
Updated
May 9
•
36
updated
a collection
5 months ago
Qwen-AWQ
Collection
4 items
•
Updated
May 9
published
a model
5 months ago
CobraMamba/Qwen3-30B-A3B-AWQ-4Bit
Text Generation
•
5B
•
Updated
May 9
•
36
New activity in
ISTA-DASLab/DeepSeek-R1-GPTQ-4b-128g-experts
5 months ago
How to Only compress non-shared experts within transformer blocks?
1
#1 opened 5 months ago by
CobraMamba
liked
a model
5 months ago
ISTA-DASLab/DeepSeek-R1-GPTQ-4b-128g-experts
Text Generation
•
Updated
Apr 8
•
8
•
4
updated
a collection
5 months ago
Qwen-AWQ
Collection
4 items
•
Updated
May 9
updated
a model
5 months ago
CobraMamba/Qwen3-32B-AWQ
Text Generation
•
6B
•
Updated
Apr 30
•
56
published
a model
5 months ago
CobraMamba/Qwen3-32B-AWQ
Text Generation
•
6B
•
Updated
Apr 30
•
56
updated
a collection
5 months ago
Qwen-AWQ
Collection
4 items
•
Updated
May 9
updated
a model
5 months ago
CobraMamba/Qwen3-8B-AWQ
Text Generation
•
2B
•
Updated
Apr 30
•
3
published
a model
5 months ago
CobraMamba/Qwen3-8B-AWQ
Text Generation
•
2B
•
Updated
Apr 30
•
3
New activity in
CobraMamba/mamba-gpt-7b
5 months ago
Adding `safetensors` variant of this model
#2 opened 11 months ago by
SFconvertbot
New activity in
CobraMamba/mamba-gpt-7b-v2
5 months ago
Adding `safetensors` variant of this model
#2 opened 11 months ago by
SFconvertbot
New activity in
CobraMamba/mamba-gpt-7b-v1
5 months ago
Base Model
1
#2 opened 11 months ago by
Shameless111
Adding `safetensors` variant of this model
#3 opened 9 months ago by
SFconvertbot
updated
a collection
5 months ago
Qwen-AWQ
Collection
4 items
•
Updated
May 9
Load more