MaLA corpus Collection MaLA Corpus for Massive Language Adaptation of Large Language Models • 17 items • Updated 1 day ago • 6
MaLA corpus Collection MaLA Corpus for Massive Language Adaptation of Large Language Models • 17 items • Updated 1 day ago • 6
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models Paper • 2409.17892 • Published Sep 26, 2024 • 2
ObscuraCoder: Powering Efficient Code LM Pre-Training Via Obfuscation Grounding Paper • 2504.00019 • Published Mar 27
IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code Generators Paper • 2403.03894 • Published Mar 6, 2024
An Expanded Massive Multilingual Dataset for High-Performance Language Technologies Paper • 2503.10267 • Published Mar 13
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models Paper • 2409.17892 • Published Sep 26, 2024 • 2
An Expanded Massive Multilingual Dataset for High-Performance Language Technologies Paper • 2503.10267 • Published Mar 13
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models Paper • 2409.17892 • Published Sep 26, 2024 • 2
The Highs and Lows of Simple Lexical Domain Adaptation Approaches for Neural Machine Translation Paper • 2101.00421 • Published Jan 2, 2021
To Adapt or to Fine-tune: A Case Study on Abstractive Summarization Paper • 2208.14559 • Published Aug 30, 2022
A Unified Model for Reverse Dictionary and Definition Modelling Paper • 2205.04602 • Published May 9, 2022
Approaching Neural Chinese Word Segmentation as a Low-Resource Machine Translation Task Paper • 2008.05348 • Published Aug 12, 2020