LLaDA-MedV: Exploring Large Language Diffusion Models for Biomedical Image Understanding Paper • 2508.01617 • Published Aug 3
Reasoning Models Can be Accurately Pruned Via Chain-of-Thought Reconstruction Paper • 2509.12464 • Published Sep 15
Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller LLMs Paper • 2509.25779 • Published 19 days ago • 16