DeepSeek-R1-Distill-Qwen-7B-GRPO / train_results.json

Commit History

Model save
a62814b
verified

Kadins commited on