u-10bei/llm-jp-3-13b-instruct2-chat-GSM8K-math2.0-cot2-grpo2-merged Text Generation • 14B • Updated Apr 4 • 3
u-10bei/llm-jp-3-13b-instruct2-chat-sft1_2-grpo2_2-2000-merged Text Generation • 14B • Updated Mar 24 • 3
u-10bei/llm-jp-3-13b-instruct2-chat-sft1_2-grpo2_2-1500-merged Text Generation • 14B • Updated Mar 24 • 3
u-10bei/llm-jp-3-13b-instruct2-chat-sft1_2-grpo2_2-1000-merged Text Generation • 14B • Updated Mar 24 • 3
u-10bei/llm-jp-3-13b-instruct2-chat-sft1_2-grpo2_2-500-merged Text Generation • 14B • Updated Mar 24 • 3
u-10bei/llm-jp-3-13b-instruct2-gpro-0222_OpenMATHinstruct_1800_sft Text Generation • 14B • Updated Mar 1 • 2
u-10bei/llm-jp-3-13b-instruct2-grpo-MATH-lighteval_step1000 Text Generation • 14B • Updated Mar 1 • 2
u-10bei/llm-jp-3-13b-instruct2-grpo-0222_lora_step2000_ja5000 Text Generation • 14B • Updated Feb 26 • 2
u-10bei/llm-jp-3-13b-instruct2-grpo-0222_lora_step2000_ja2000 Text Generation • 14B • Updated Feb 26 • 4