Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
EssentialAI
's Collections
Essential-Web v1.0
Rethinking Reflection in Pre-Training
Essential-Web v1.0
updated
Jun 18
Upvote
8
Essential-Web v1.0: 24T tokens of organized web data
Paper
•
2506.14111
•
Published
Jun 17
•
46
EssentialAI/essential-web-v1.0
Preview
•
Updated
23 days ago
•
89.9k
•
204
EssentialAI/eai-distill-0.5b
0.6B
•
Updated
Jun 18
•
558
•
23
EssentialAI/eai-taxonomy-math-w-fm
Viewer
•
Updated
Jun 22
•
21.6M
•
457
•
5
EssentialAI/eai-taxonomy-code-w-dclm
Viewer
•
Updated
Jun 22
•
274M
•
4.42k
•
8
EssentialAI/eai-taxonomy-code-w-dclm-100b-sample
Viewer
•
Updated
Jun 22
•
46.2M
•
252
•
2
EssentialAI/eai-taxonomy-med-w-dclm
Viewer
•
Updated
Jun 22
•
81.2M
•
332
•
8
EssentialAI/eai-taxonomy-med-w-dclm-100b-sample
Viewer
•
Updated
Jun 22
•
36.6M
•
215
•
2
EssentialAI/eai-taxonomy-stem-w-dclm
Preview
•
Updated
Jun 22
•
550
•
5
EssentialAI/eai-taxonomy-stem-w-dclm-100b-sample
Viewer
•
Updated
Jun 22
•
35.5M
•
357
•
4
Upvote
8
+4
Share collection
View history
Collection guide
Browse collections