sergiopaniego
/

Qwen2-0-5B-GRPO-vllm-trl

Generated from Trainer

Model card Files Files and versions

Qwen2-0-5B-GRPO-vllm-trl / merges.txt

sergiopaniego's picture

sergiopaniego HF Staff

Training in progress, step 10

31a052a verified about 2 months ago

history contribute delete

1.67 MB

File too large to display, you can check the raw version instead.