DeepSeek-R1-Distill-Qwen-7B-GRPO-v8 / model.safetensors.index.json

Commit History

Training in progress, step 50
b5a5bb7
verified

Kadins commited on