EmoAgent: Assessing and Safeguarding Human-AI Interaction for Mental Health Safety Paper • 2504.09689 • Published Apr 13 • 7
Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion Paper • 2504.11447 • Published Apr 15 • 5
Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning Paper • 2505.16270 • Published May 22 • 6
On Path to Multimodal Historical Reasoning: HistBench and HistAgent Paper • 2505.20246 • Published May 26
Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution Paper • 2505.20286 • Published May 26 • 7
Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning Paper • 2506.03136 • Published Jun 3 • 24
Toward Scientific Reasoning in LLMs: Training from Expert Discussions via Reinforcement Learning Paper • 2505.19501 • Published May 26 • 1