Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning Paper • 2505.01441 • Published 12 days ago • 32
WebThinker: Empowering Large Reasoning Models with Deep Research Capability Paper • 2504.21776 • Published 9 days ago • 43
Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper • 2504.20571 • Published 11 days ago • 90
NodeRAG: Structuring Graph-based RAG with Heterogeneous Nodes Paper • 2504.11544 • Published 24 days ago • 42
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations Paper • 2504.10481 • Published 25 days ago • 84