chenggong1995/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-grpo Text Generation • Updated 8 days ago • 124
chenggong1995/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-ghpo Text Generation • Updated 8 days ago • 152
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-olympiads-aime-unique-cosine Text Generation • Updated 8 days ago • 154
chenggong1995/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-unique-ghpo-beta0-epoch2 Text Generation • Updated 7 days ago • 220
chenggong1995/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-unique-grpo-beta0-epoch2 Text Generation • Updated 6 days ago • 47
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-olympiads-aime-unique-cosine-v4 Text Generation • Updated 6 days ago • 1
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-olympiads-aime-unique-cosine-only Text Generation • Updated 6 days ago • 4
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-olympiads-aime-unique-cosine-noRW-noformat Text Generation • Updated 2 days ago • 42