GenX: Mastering Code and Test Generation with Execution Feedback Paper • 2412.13464 • Published Dec 18, 2024 • 1
Improved Visual-Spatial Reasoning via R1-Zero-Like Training Paper • 2504.00883 • Published 25 days ago • 62
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 386