Zero-shot Benchmarking: A Framework for Flexible and Scalable Automatic Evaluation of Language Models Paper • 2504.01001 • Published Apr 1 • 1
Tower+: Bridging Generality and Translation Specialization in Multilingual LLMs Paper • 2506.17080 • Published Jun 20 • 4
Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis Paper • 2409.20059 • Published Sep 30, 2024 • 17
xTower: A Multilingual LLM for Explaining and Correcting Translation Errors Paper • 2406.19482 • Published Jun 27, 2024
WMT24++: Expanding the Language Coverage of WMT24 to 55 Languages & Dialects Paper • 2502.12404 • Published Feb 18
QUEST: Quality-Aware Metropolis-Hastings Sampling for Machine Translation Paper • 2406.00049 • Published May 28, 2024
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published Mar 7 • 81
Tower: An Open Multilingual Large Language Model for Translation-Related Tasks Paper • 2402.17733 • Published Feb 27, 2024 • 7
CroissantLLM: A Truly Bilingual French-English Language Model Paper • 2402.00786 • Published Feb 1, 2024 • 27
AfriMTE and AfriCOMET: Empowering COMET to Embrace Under-resourced African Languages Paper • 2311.09828 • Published Nov 16, 2023 • 1
Disentangling Uncertainty in Machine Translation Evaluation Paper • 2204.06546 • Published Apr 13, 2022
xCOMET: Transparent Machine Translation Evaluation through Fine-grained Error Detection Paper • 2310.10482 • Published Oct 16, 2023 • 3
Steering Large Language Models for Machine Translation with Finetuning and In-Context Learning Paper • 2310.13448 • Published Oct 20, 2023 • 1
Scaling up COMETKIWI: Unbabel-IST 2023 Submission for the Quality Estimation Shared Task Paper • 2309.11925 • Published Sep 21, 2023
Unbabel's Participation in the WMT20 Metrics Shared Task Paper • 2010.15535 • Published Oct 29, 2020 • 2
CometKiwi: IST-Unbabel 2022 Submission for the Quality Estimation Shared Task Paper • 2209.06243 • Published Sep 13, 2022