Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models Paper • 2503.21380 • Published 13 days ago • 36
ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering Paper • 2503.16867 • Published 19 days ago • 11
The Dawn After the Dark: An Empirical Study on Factuality Hallucination in Large Language Models Paper • 2401.03205 • Published Jan 6, 2024
Towards Effective and Efficient Continual Pre-training of Large Language Models Paper • 2407.18743 • Published Jul 26, 2024
Technical Report: Enhancing LLM Reasoning with Reward-guided Tree Search Paper • 2411.11694 • Published Nov 18, 2024
Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems Paper • 2412.09413 • Published Dec 12, 2024 • 1
An Empirical Study on Eliciting and Improving R1-like Reasoning Models Paper • 2503.04548 • Published Mar 6 • 8
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning Paper • 2503.05592 • Published Mar 7 • 25
An Empirical Study on Eliciting and Improving R1-like Reasoning Models Paper • 2503.04548 • Published Mar 6 • 8
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper • 2501.03262 • Published Jan 4 • 99