-
From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review
Paper • 2504.19678 • Published • 3 -
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
Paper • 2509.25454 • Published • 133 -
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation
Paper • 2509.25849 • Published • 46
Lee PRO
Chanhui
AI & ML interests
None yet
Recent Activity
liked
a model
about 1 hour ago
futurehouse/ether0
updated
a collection
21 days ago
LLM-Reasoning
updated
a collection
21 days ago
LLM-Reasoning