Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Kadins
/
DeepSeek-R1-Distill-Qwen-7B-GRPO-v7
like
0
Text Generation
Transformers
Safetensors
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
DeepSeek-R1-Distill-Qwen-7B-GRPO-v7
Commit History
Model save
16e7ac5
verified
Kadins
commited on
Mar 15
Training in progress, step 385
a5142a4
verified
Kadins
commited on
Mar 15
Training in progress, step 350
d9cf1d8
verified
Kadins
commited on
Mar 15
Training in progress, step 300
5104750
verified
Kadins
commited on
Mar 15
Training in progress, step 250
de1b7af
verified
Kadins
commited on
Mar 15
Training in progress, step 200
12f732b
verified
Kadins
commited on
Mar 15
Training in progress, step 150
e74cc91
verified
Kadins
commited on
Mar 15
Training in progress, step 100
8c32d82
verified
Kadins
commited on
Mar 15
Training in progress, step 50
4c52135
verified
Kadins
commited on
Mar 15
initial commit
ddc1ada
verified
Kadins
commited on
Mar 14