Catherine Arnett

catherinearnett

AI & ML interests

multilingual NLP, tokenization

Articles

Organizations

catherinearnett's activity

upvoted an article 4 days ago
view article
Article

Releasing the largest multilingual open pretraining dataset

88