How to Train Your LLM Web Agent: A Statistical Diagnosis Paper • 2507.04103 • Published 13 days ago • 46
PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation Paper • 2310.14192 • Published Oct 22, 2023 • 2
LitLLMs, LLMs for Literature Review: Are we there yet? Paper • 2412.15249 • Published Dec 15, 2024 • 2
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others • Jan 28 • 876