More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models Paper • 2505.21523 • Published 27 days ago • 14
Reducing Hallucinations in Vision-Language Models via Latent Space Steering Paper • 2410.15778 • Published Oct 21, 2024 • 1
Reducing Hallucinations in Vision-Language Models via Latent Space Steering Paper • 2410.15778 • Published Oct 21, 2024 • 1
OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning Paper • 2502.11271 • Published Feb 16 • 18
TextGrad: Automatic "Differentiation" via Text Paper • 2406.07496 • Published Jun 11, 2024 • 32
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine Paper • 2408.02900 • Published Aug 6, 2024 • 31