Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
dxtustc
/
DeepSeek-R1-Distill-Qwen-1.5B-GRPO
like
0
Safetensors
qwen2
Model card
Files
Files and versions
Community
main
DeepSeek-R1-Distill-Qwen-1.5B-GRPO
/
model.safetensors
Commit History
Training in progress, step 1000
2b9fec8
verified
dxtustc
commited on
Mar 8
Training in progress, step 950
68333b9
verified
dxtustc
commited on
Mar 7
Training in progress, step 900
d64d0f9
verified
dxtustc
commited on
Mar 7
Training in progress, step 850
3398b27
verified
dxtustc
commited on
Mar 7
Training in progress, step 800
c7a13b0
verified
dxtustc
commited on
Mar 7
Training in progress, step 750
2215fde
verified
dxtustc
commited on
Mar 7
Training in progress, step 700
e30f54c
verified
dxtustc
commited on
Mar 7
Training in progress, step 650
abdc760
verified
dxtustc
commited on
Mar 7
Training in progress, step 600
5d84fe8
verified
dxtustc
commited on
Mar 7
Training in progress, step 550
afa843c
verified
dxtustc
commited on
Mar 7
Training in progress, step 500
6db2a37
verified
dxtustc
commited on
Mar 7
Training in progress, step 450
42792de
verified
dxtustc
commited on
Mar 7
Training in progress, step 400
8aa488a
verified
dxtustc
commited on
Mar 7
Training in progress, step 350
adb3cbf
verified
dxtustc
commited on
Mar 7
Training in progress, step 300
b5a1f2c
verified
dxtustc
commited on
Mar 7
Training in progress, step 250
6ec27fc
verified
dxtustc
commited on
Mar 7
Training in progress, step 200
217475b
verified
dxtustc
commited on
Mar 7
Training in progress, step 150
f22a583
verified
dxtustc
commited on
Mar 7
Training in progress, step 100
f9e203e
verified
dxtustc
commited on
Mar 7
Training in progress, step 50
6047e80
verified
dxtustc
commited on
Mar 7