chenggong1995/Qwen-2.5-Base-7B-Zero-CL-gen8-om220k-hard-data8000-v1 Text Generation • 8B • Updated Mar 21
chenggong1995/Qwen2.5-3B-Instruct-Distill-om220k-fem32768-batch32-epoch3-8192-grpo-E3 Text Generation • 3B • Updated Mar 18 • 1
chenggong1995/Qwen2.5-3B-Instruct-Distill-bs17k-batch32-epoch3-8192-grpo-E3 Text Generation • 3B • Updated Mar 17 • 4