alphadl
/

R1-Distill-1.5B-Qwen-GRPO

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

R1-Distill-1.5B-Qwen-GRPO / vocab.json

alphadl's picture

Training in progress, step 500

e526f3e verified 2 months ago

history contribute delete

2.78 MB

File too large to display, you can check the raw version instead.