chenggong1995
/

Qwen-2.5-Base-7B-gen8-mix_hint50-grpo-CL-beta1e-3-epoch1-v2

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

Qwen-2.5-Base-7B-gen8-mix_hint50-grpo-CL-beta1e-3-epoch1-v2 / vocab.json

chenggong1995's picture

Training in progress, epoch 0

51dd17c verified 2 months ago

history contribute delete

2.78 MB

File too large to display, you can check the raw version instead.