Language Surgery in Multilingual Large Language Models Paper • 2506.12450 • Published 12 days ago • 16
DistilWhisper: Efficient Distillation of Multi-task Speech Models via Language-Specific Experts Paper • 2311.01070 • Published Nov 2, 2023 • 2
What Do Compressed Multilingual Machine Translation Models Forget? Paper • 2205.10828 • Published May 22, 2022 • 1
SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages Paper • 2210.11621 • Published Oct 20, 2022 • 2
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper • 2211.05100 • Published Nov 9, 2022 • 32
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting Paper • 2212.09535 • Published Dec 19, 2022 • 1
Memory-efficient NLLB-200: Language-specific Expert Pruning of a Massively Multilingual Machine Translation Model Paper • 2212.09811 • Published Dec 19, 2022 • 1
BERGEN: A Benchmarking Library for Retrieval-Augmented Generation Paper • 2407.01102 • Published Jul 1, 2024
Provence: efficient and robust context pruning for retrieval-augmented generation Paper • 2501.16214 • Published Jan 27
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages Paper • 2406.10118 • Published Jun 14, 2024 • 33
Towards Understanding the Fragility of Multilingual LLMs against Fine-Tuning Attacks Paper • 2410.18210 • Published Oct 23, 2024
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs Paper • 2502.12982 • Published Feb 18 • 18
Attention-Based LSTM for Psychological Stress Detection from Spoken Language Using Distant Supervision Paper • 1805.12307 • Published May 31, 2018
NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages Paper • 2309.10661 • Published Sep 19, 2023 • 1