Token-level and sequence-level loss smoothing for RNN language models Paper • 1805.05062 • Published May 14, 2018
Efficient Wait-k Models for Simultaneous Machine Translation Paper • 2005.08595 • Published May 18, 2020
Added Toxicity Mitigation at Inference Time for Multimodal and Massively Multilingual Translation Paper • 2311.06532 • Published Nov 11, 2023
Large Concept Models: Language Modeling in a Sentence Representation Space Paper • 2412.08821 • Published Dec 11, 2024 • 15
SpiRit-LM: Interleaved Spoken and Written Language Model Paper • 2402.05755 • Published Feb 8, 2024 • 15
Seamless: Multilingual Expressive and Streaming Speech Translation Paper • 2312.05187 • Published Dec 8, 2023 • 14
No Language Left Behind: Scaling Human-Centered Machine Translation Paper • 2207.04672 • Published Jul 11, 2022 • 2
Pervasive Attention: 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction Paper • 1808.03867 • Published Aug 11, 2018
SeamlessM4T-Massively Multilingual & Multimodal Machine Translation Paper • 2308.11596 • Published Aug 22, 2023 • 1
Causes and Cures for Interference in Multilingual Translation Paper • 2212.07530 • Published Dec 14, 2022
Towards Being Parameter-Efficient: A Stratified Sparsely Activated Transformer with Dynamic Capacity Paper • 2305.02176 • Published May 3, 2023
Efficiently Upgrading Multilingual Machine Translation Models to Support More Languages Paper • 2302.03528 • Published Feb 7, 2023
Fixing MoE Over-Fitting on Low-Resource Languages in Multilingual Machine Translation Paper • 2212.07571 • Published Dec 15, 2022