Math GitBag/gemma-2-9b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 44 GitBag/llama-3_1-70b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 30 GitBag/gemma-2-27b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 35 GitBag/llama-3-8b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 29
GitBag/a_star_final_ds-distilled-qwen-1.5b-a-star-16384_actor Text Generation • Updated 16 days ago • 62
GitBag/a_star_final_ds-distilled-qwen-1.5b-grpo-2-kl-1e-4-16384_actor Text Generation • Updated 16 days ago • 58
GitBag/a_star_final_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-16384_critic Token Classification • Updated about 1 month ago • 15
GitBag/a_star_final_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-16384_actor Text Generation • Updated about 1 month ago • 63
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-good-1_hmmt-feb-25_eval_new_1024 Viewer • Updated about 1 month ago • 30.7k • 59
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-good-1_aime-24_eval_new_1024 Viewer • Updated about 1 month ago • 30.7k • 66
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-good-1_hmmt-feb-24_eval_new_1024 Viewer • Updated about 1 month ago • 30.7k • 62
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-good-1_aime-25_eval_new_1024 Viewer • Updated about 1 month ago • 30.7k • 62
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-good-1_aime-25_eval_new_256 Viewer • Updated May 12 • 7.68k • 55
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-good-1_hmmt-feb-25_eval_new_256 Viewer • Updated May 12 • 7.68k • 57
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-good-1_aime-24_eval_new_256 Viewer • Updated May 12 • 7.68k • 57