WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models
Paper
• 2112.06598 • Published
• 1
gpt2 transferred to Ukrainian using the method from the NAACL2022 paper WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.