SilVar: Speech Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization Paper • 2412.16771 • Published Dec 21, 2024
SilVar-Med: A Speech-Driven Visual Language Model for Explainable Abnormality Detection in Medical Imaging Paper • 2504.10642 • Published Apr 14 • 2
MultiMed-ST: Large-scale Many-to-many Multilingual Medical Speech Translation Paper • 2504.03546 • Published Apr 4 • 1
RainbowPlus: Enhancing Adversarial Prompt Generation via Evolutionary Quality-Diversity Search Paper • 2504.15047 • Published Apr 21 • 6
RainbowPlus: Enhancing Adversarial Prompt Generation via Evolutionary Quality-Diversity Search Paper • 2504.15047 • Published Apr 21 • 6