Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
BigScience Catalogue Data
non-profit
https://bigscience.huggingface.co
Activity Feed
Request to join this org
Follow
56
AI & ML interests
None defined yet.
Recent Activity
lvwerra
authored
a paper
12 days ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
loubnabnl
authored
a paper
about 1 month ago
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text
stellaathena
authored
a paper
about 1 month ago
Emergent and Predictable Memorization in Large Language Models
View all activity
Team members
45
+11
models
0
None public yet
datasets
2
Sort: Recently updated
bigscience-catalogue-data/shades_nationality
Viewer
•
Updated
Oct 9, 2024
•
35.5k
•
324
•
4
bigscience-catalogue-data/bias-shades
Updated
May 1, 2022
•
12
•
5