ACECODER: Acing Coder RL via Automated Test-Case Synthesis Paper • 2502.01718 • Published 18 days ago • 28
ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning Paper • 2502.01100 • Published 19 days ago • 15
HALoGEN: Fantastic LLM Hallucinations and Where to Find Them Paper • 2501.08292 • Published Jan 14 • 17