Skill Discovery for Software Scripting Automation via Offline Simulations with LLMs Paper • 2504.20406 • Published 13 days ago • 6
AdaR1: From Long-CoT to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization Paper • 2504.21659 • Published 12 days ago • 11
LLMs for Engineering: Teaching Models to Design High Powered Rockets Paper • 2504.19394 • Published 14 days ago • 13
Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks Paper • 2505.00234 • Published 11 days ago • 22
DeepCritic: Deliberate Critique with Large Language Models Paper • 2505.00662 • Published 10 days ago • 48
WebThinker: Empowering Large Reasoning Models with Deep Research Capability Paper • 2504.21776 • Published 12 days ago • 44
OpenMathReasoning Collection Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset" • 7 items • Updated 2 days ago • 36