chenggong1995/openr1-Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-grpo-epoch3 Text Generation • Updated 2 days ago • 2
weizhepei/Qwen2.5-3B-WebArena-Lite-SFT-CoT-QwQ-32B-epoch-1-no-sys-new Text Generation • Updated 2 days ago • 5
flyingbugs/Qwen2.5-Math-7B-Instruct-Math220k-correctness-0.2 Text Generation • Updated about 23 hours ago
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-olympiads-aime-unique-cosine-noRW-noformat Text Generation • Updated about 11 hours ago
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpecReasoner_SFT_14k Text Generation • Updated about 9 hours ago