Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Big Science - Modeling Metadata
non-profit
https://github.com/bigscience-workshop/metadata
Activity Feed
Request to join this org
Follow
23
AI & ML interests
None defined yet.
Recent Activity
stellaathena
authored
a paper
17 days ago
Emergent and Predictable Memorization in Large Language Models
stellaathena
authored
a paper
17 days ago
KMMLU: Measuring Massive Multitask Language Understanding in Korean
stellaathena
authored
a paper
17 days ago
Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence
View all activity
Team members
20
bs-modeling-metadata
's datasets
6
Sort: Recently updated
bs-modeling-metadata/c4-en-html-with-training_metadata_all
Viewer
•
Updated
Apr 1, 2023
•
33.3k
•
75
bs-modeling-metadata/c4-en-html-with-metadata
Viewer
•
Updated
Aug 18, 2022
•
44.6M
•
978
•
10
bs-modeling-metadata/website_metadata_c4
Viewer
•
Updated
Nov 24, 2021
•
52.6k
•
46
•
3
bs-modeling-metadata/wiki_dump
Updated
Nov 23, 2021
•
106
bs-modeling-metadata/c4_newslike_url_only
Viewer
•
Updated
Sep 20, 2021
•
13.8M
•
21
bs-modeling-metadata/OSCAR_Entity_13_000
Viewer
•
Updated
Sep 15, 2021
•
10.7k
•
25
•
1