Towards Understanding the Robustness of Sparse Autoencoders Paper • 2604.18756 • Published 20 days ago • 10
When Background Matters: Breaking Medical Vision Language Models by Transferable Attack Paper • 2604.17318 • Published 21 days ago • 3
RadAgent: A tool-using AI agent for stepwise interpretation of chest computed tomography Paper • 2604.15231 • Published 24 days ago • 6
CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare Paper • 2603.24157 • Published Mar 25 • 10
CURE-Med: Curriculum-Informed Reinforcement Learning for Multilingual Medical Reasoning Paper • 2601.13262 • Published Jan 19 • 3
CLINIC: Evaluating Multilingual Trustworthiness in Language Models for Healthcare Paper • 2512.11437 • Published Dec 12, 2025 • 4
left|,circlearrowright,text{BUS},right|: A Large and Diverse Multimodal Benchmark for evaluating the ability of Vision-Language Models to understand Rebus Puzzles Paper • 2511.01340 • Published Nov 3, 2025 • 13
M3Retrieve: Benchmarking Multimodal Retrieval for Medicine Paper • 2510.06888 • Published Oct 8, 2025 • 4
Leveraging Large Language Models for Predictive Analysis of Human Misery Paper • 2508.12669 • Published Aug 18, 2025 • 14
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens Paper • 2506.17218 • Published Jun 20, 2025 • 29