Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
BigScience Catalogue Data
non-profit
https://bigscience.huggingface.co
Activity Feed
Request to join this org
Follow
56
AI & ML interests
None defined yet.
Recent Activity
lvwerra
authored
a paper
13 days ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
loubnabnl
authored
a paper
about 1 month ago
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text
stellaathena
authored
a paper
about 1 month ago
Emergent and Predictable Memorization in Large Language Models
View all activity
Team members
45
+11
bigscience-catalogue-data
's models
None public yet