Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Kadins
/
DeepSeek-R1-Distill-Qwen-7B-GRPO-v7-2
like
0
Text Generation
Transformers
Safetensors
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
DeepSeek-R1-Distill-Qwen-7B-GRPO-v7-2
Commit History
Model save
e9cf7ae
verified
Kadins
commited on
Mar 18
Training in progress, step 385
fa522d9
verified
Kadins
commited on
Mar 18
Training in progress, step 350
7d13722
verified
Kadins
commited on
Mar 18
Training in progress, step 250
13e107e
verified
Kadins
commited on
Mar 18
Training in progress, step 200
8be7cdb
verified
Kadins
commited on
Mar 18
Training in progress, step 150
c4b3ec3
verified
Kadins
commited on
Mar 17
Training in progress, step 100
dc8e9d3
verified
Kadins
commited on
Mar 17
Training in progress, step 50
a4ecd55
verified
Kadins
commited on
Mar 17
initial commit
b0aecde
verified
Kadins
commited on
Mar 17