Shirong Ma
msr2000
AI & ML interests
None yet
Recent Activity
updated
a model
13 days ago
deepseek-ai/DeepSeek-R1-Zero
updated
a model
13 days ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
updated
a model
13 days ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Organizations
msr2000's activity
qwen32B蒸馏模型,长度>8k时,预测一定比例乱码,出现<think><think><think><think><think><think>
5
#44 opened 28 days ago
by
daniellibin
Upload 1735146950945.jpg
3
#11 opened about 2 months ago
by
NZEEMSZY

Create Dondasse
#15 opened about 2 months ago
by
Dondasse
Water and forests
2
#16 opened about 2 months ago
by
Dondasse
Update README.md with vLLM Support
1
#8 opened about 2 months ago
by
simon-mo
Update README.md with vLLM Support
#28 opened about 2 months ago
by
simon-mo
fail to run the example
8
#4 opened 10 months ago
by
Leymore

keyError: 'sdpa'
1
#3 opened 10 months ago
by
fengzi258
vllm support
7
#2 opened 10 months ago
by
Sihangli

KV Cache for compress_kv or key-value states
6
#1 opened 10 months ago
by
House-99