chenggong1995/openr1-Qwen-2.5-Base-3B-gen8-scale-NuminaMath-TIR-100-grpo-beta0-epoch2 Text Generation • 3B • Updated Apr 11
chenggong1995/Qwen-2.5-Base-3B-gen8-scale-MATH-lighteval-olympiads_aime-unique-ghpo-beta0-epoch3 Text Generation • 3B • Updated Apr 11 • 1
chenggong1995/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-unique-ghpo-beta0-epoch3-NP 8B • Updated Apr 10
chenggong1995/Qwen-2.5-Base-3B-gen8-scale-math_selected-grpo-beta0-epoch3 Text Generation • 3B • Updated Apr 10
chenggong1995/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-unique-grpo-beta0-epoch2 Text Generation • 8B • Updated Apr 9
chenggong1995/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-unique-ghpo-beta0-epoch2 Text Generation • 8B • Updated Apr 9
chenggong1995/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-unique-ghpo-beta1e-4-epoch2 8B • Updated Apr 9
chenggong1995/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-ghpo Text Generation • 8B • Updated Apr 8
chenggong1995/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-grpo Text Generation • 8B • Updated Apr 7
chenggong1995/Qwen-2.5-Base-7B-mixed-gen8-scale-ghpo-hint0.5-epoch1 Text Generation • 8B • Updated Apr 7 • 1
chenggong1995/Qwen-2.5-Base-7B-mixed-gen8-noscale-ghpo-hint0.5-epoch1 Text Generation • 8B • Updated Apr 5
chenggong1995/Qwen-2.5-Base-7B-mixed-gen8-scale-ghpo-hint0.4-epoch1 Text Generation • 8B • Updated Apr 2
chenggong1995/Qwen-2.5-Base-7B-mixed-gen8-scale-ghpo-hint0.6-epoch1 Text Generation • 8B • Updated Apr 2
chenggong1995/Qwen-2.5-Base-7B-mixed-gen8-scale-ghpo-hint0.9-epoch1 Text Generation • 8B • Updated Apr 2
chenggong1995/Qwen-2.5-Base-7B-mixed-gen8-scale-ghpo-hint0.3-epoch1 Text Generation • 8B • Updated Apr 1
chenggong1995/Qwen-2.5-Base-7B-mixed-gen8-scale-ghpo-hint0.7 Text Generation • 8B • Updated Apr 1 • 1
chenggong1995/Qwen-2.5-Base-7B-Zero-CL-gen8-om220k-hard-data8000-hint Text Generation • 8B • Updated Mar 23