VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity Paper • 2503.11557 • Published 9 days ago • 19
SurveyX: Academic Survey Automation via Large Language Models Paper • 2502.14776 • Published about 1 month ago • 95