arxiv:2506.09958
Sushant Gautam PRO
SushantGautam
AI & ML interests
multimodal, deep learning
Recent Activity
authored
a paper
1 day ago
Kvasir-VQA-x1: A Multimodal Dataset for Medical Reasoning and Robust
MedVQA in Gastrointestinal Endoscopy
authored
a paper
1 day ago
Point, Detect, Count: Multi-Task Medical Image Understanding with
Instruction-Tuned Vision-Language Models
authored
a paper
1 day ago
SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game
Understanding