Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Big Science - Modeling Metadata

non-profit
https://github.com/bigscience-workshop/metadata
Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

vumichien  authored a paper 8 days ago
MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources
vumichien  authored a paper 8 days ago
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution
HugoLaurencon  authored a paper 29 days ago
ARE: Scaling Up Agent Environments and Evaluations
View all activity

Victor Sanh's profile picture gerard dupont's profile picture Niklas Muennighoff's profile picture Jonathan Chang's profile picture Christopher's profile picture Manan Dey's profile picture Stella Biderman's profile picture vumichien's profile picture Lucile Saulnier's profile picture M Saiful Bari's profile picture Mike Tian-Jian Jiang's profile picture Shanya Sharma's profile picture Jordan Clive's profile picture Timo Schick's profile picture Nora Kassner's profile picture Hugo Laurençon's profile picture Machine User Bs Metadata WG's profile picture Leo Tronchon's profile picture Paul Pommer's profile picture Masoud Jalili Sabet's profile picture

bs-modeling-metadata 's datasets 6

bs-modeling-metadata/c4-en-html-with-training_metadata_all

Viewer • Updated Apr 1, 2023 • 33.3k • 517

bs-modeling-metadata/c4-en-html-with-metadata

Viewer • Updated Aug 18, 2022 • 44.6M • 452 • 10

bs-modeling-metadata/website_metadata_c4

Viewer • Updated Nov 24, 2021 • 52.6k • 43 • 3

bs-modeling-metadata/wiki_dump

Updated Nov 23, 2021 • 23

bs-modeling-metadata/c4_newslike_url_only

Viewer • Updated Sep 20, 2021 • 13.8M • 59

bs-modeling-metadata/OSCAR_Entity_13_000

Viewer • Updated Sep 15, 2021 • 10.7k • 20 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs