Med-REFL: Medical Reasoning Enhancement via Self-Corrected Fine-grained Reflection Paper • 2506.13793 • Published Jun 11 • 6
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning Paper • 2506.07044 • Published Jun 8 • 112
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published Mar 18 • 137