Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
2
1
1
Catherine Arnett
catherinearnett
Follow
pranavs6's profile picture
lunarflu's profile picture
21world's profile picture
15 followers
·
1 following
https://catherinearnett.github.io/
linguist_cat
catherinearnett
AI & ML interests
multilingual NLP, tokenization
Articles
Releasing the largest multilingual open pretraining dataset
4 days ago
•
88
Detoxifying the Commons
17 days ago
•
6
wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??
Sep 27
•
35
Organizations
catherinearnett
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a dataset
5 months ago
ambean/lingOly
Viewer
•
Updated
Jun 11
•
90
•
211
•
7