Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Big Science - Modeling Metadata

non-profit
https://github.com/bigscience-workshop/metadata
Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

stellaathena  authored a paper 17 days ago
Emergent and Predictable Memorization in Large Language Models
stellaathena  authored a paper 17 days ago
KMMLU: Measuring Massive Multitask Language Understanding in Korean
stellaathena  authored a paper 17 days ago
Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence
View all activity

Timo Schick's profile picture Nora Kassner's profile picture Lucile Saulnier's profile picture Shanya Sharma's profile picture Christopher's profile picture Mike Tian-Jian Jiang's profile picture gerard dupont's profile picture Stella Biderman's profile picture Machine User Bs Metadata WG's profile picture Hugo Laurençon's profile picture Victor Sanh's profile picture Leo Tronchon's profile picture Paul Pommer's profile picture Masoud Jalili Sabet's profile picture Niklas Muennighoff's profile picture Jordan Clive's profile picture Manan Dey's profile picture M Saiful Bari's profile picture Jonathan Chang's profile picture vumichien's profile picture

bs-modeling-metadata 's datasets 6

bs-modeling-metadata/c4-en-html-with-training_metadata_all

Viewer • Updated Apr 1, 2023 • 33.3k • 75

bs-modeling-metadata/c4-en-html-with-metadata

Viewer • Updated Aug 18, 2022 • 44.6M • 978 • 10

bs-modeling-metadata/website_metadata_c4

Viewer • Updated Nov 24, 2021 • 52.6k • 46 • 3

bs-modeling-metadata/wiki_dump

Updated Nov 23, 2021 • 106

bs-modeling-metadata/c4_newslike_url_only

Viewer • Updated Sep 20, 2021 • 13.8M • 21

bs-modeling-metadata/OSCAR_Entity_13_000

Viewer • Updated Sep 15, 2021 • 10.7k • 25 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs