Agent tuning zai-org/SWE-Dev-train Viewer • Updated Jul 9 • 20.1k • 372 • 6 SWE-Gym/OpenHands-SFT-Trajectories Viewer • Updated May 10 • 491 • 224 • 13 lmarena-ai/webdev-arena-preference-10k Viewer • Updated Mar 10 • 10.5k • 256 • 12 SWE-bench/SWE-smith-trajectories Viewer • Updated Jul 19 • 76k • 1.42k • 24
Smol Agents papers Distilling LLM Agent into Small Models with Retrieval and Code Tools Paper • 2505.17612 • Published May 23 • 81
Distilling LLM Agent into Small Models with Retrieval and Code Tools Paper • 2505.17612 • Published May 23 • 81
Agent Benchmarks xw27/scibench Viewer • Updated May 6, 2024 • 692 • 733 • 19 google/frames-benchmark Viewer • Updated Oct 15, 2024 • 824 • 2.84k • 223 gaia-benchmark/GAIA Updated Feb 13 • 14.5k • 436 HuggingFaceH4/MATH-500 Viewer • Updated Nov 15, 2024 • 500 • 91.7k • 180
Agent tuning zai-org/SWE-Dev-train Viewer • Updated Jul 9 • 20.1k • 372 • 6 SWE-Gym/OpenHands-SFT-Trajectories Viewer • Updated May 10 • 491 • 224 • 13 lmarena-ai/webdev-arena-preference-10k Viewer • Updated Mar 10 • 10.5k • 256 • 12 SWE-bench/SWE-smith-trajectories Viewer • Updated Jul 19 • 76k • 1.42k • 24
Agent Benchmarks xw27/scibench Viewer • Updated May 6, 2024 • 692 • 733 • 19 google/frames-benchmark Viewer • Updated Oct 15, 2024 • 824 • 2.84k • 223 gaia-benchmark/GAIA Updated Feb 13 • 14.5k • 436 HuggingFaceH4/MATH-500 Viewer • Updated Nov 15, 2024 • 500 • 91.7k • 180
Smol Agents papers Distilling LLM Agent into Small Models with Retrieval and Code Tools Paper • 2505.17612 • Published May 23 • 81
Distilling LLM Agent into Small Models with Retrieval and Code Tools Paper • 2505.17612 • Published May 23 • 81