SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages Paper • 2406.10118 • Published Jun 14, 2024 • 32
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages Paper • 2411.16508 • Published Nov 25, 2024 • 12
Dialectal Coverage And Generalization in Arabic Speech Recognition Paper • 2411.05872 • Published Nov 7, 2024 • 2
SparQLe: Speech Queries to Text Translation Through LLMs Paper • 2502.09284 • Published Feb 13 • 1
NADI 2025: The First Multidialectal Arabic Speech Processing Shared Task Paper • 2509.02038 • Published 22 days ago
SPIRIT: Patching Speech Language Models against Jailbreak Attacks Paper • 2505.13541 • Published May 18 • 2
SPIRIT: Patching Speech Language Models against Jailbreak Attacks Paper • 2505.13541 • Published May 18 • 2
NADI 2025 Sub-task 3 datasets Collection Official training and dev datasets for NADI 2025 Subtask 3 (Diacritic Restoration) Shared Task • 10 items • Updated Jul 21