chenggong1995/Qwen2.5-Math-7B-gen8-math3to5_olympiads_aime-ghpo-cold0-3Dhint-prompt1-epoch5-new44 Text Generation • 8B • Updated May 21 • 8
chenggong1995/Qwen2.5-Math-7B-gen8-math3to5_olympiads_aime-grpo-beta0-epoch5-new44 Text Generation • 8B • Updated May 20 • 7
chenggong1995/Qwen2.5-Math-7B-gen8-math3to5-ghpo-cold0-3Dhint-prompt1-epoch1 Text Generation • 8B • Updated May 19 • 9
chenggong1995/Qwen2.5-Math-7B-gen8-math3to5-grpo-beta0-epoch1 Text Generation • 8B • Updated May 19 • 9
chenggong1995/Qwen-2.5-Base-7B-gen8-mix_hint50-grpo-CL-beta0-epoch1-v2 Text Generation • 8B • Updated May 16 • 6
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold0-hint50-prompt1-redonum-test Text Generation • 8B • Updated May 16 • 7
chenggong1995/Qwen-2.5-Base-7B-gen8-mix_hint50-grpo-CL-beta1e-3-epoch1-v2 Text Generation • 8B • Updated May 15 • 6
chenggong1995/Qwen-2.5-Base-7B-gen8-mix-grpo-CL-beta1e-3-epoch1-v2 Text Generation • 8B • Updated May 14 • 7
chenggong1995/Qwen-2.5-Base-7B-gen8-mix_hint50-grpo-CL-beta1e-3-epoch1 Text Generation • 8B • Updated May 14 • 6