Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
85
11
15
Guilherme Penedo
guipenedo
Follow
CyberHug's profile picture
ahmadreza1323's profile picture
nndr72's profile picture
787 followers
·
6 following
gui_penedo
guipenedo
AI & ML interests
None yet
Articles
FineWeb2-C: Help Build Better Language Models in Your Language
26 days ago
•
13
Organizations
guipenedo
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a dataset
about 1 month ago
HuggingFaceFW/fineweb-2
Viewer
•
Updated
10 days ago
•
12.5B
•
62.1k
•
392
liked
a Space
about 2 months ago
Runtime error
34
💬
Discussion Forum
liked
a model
3 months ago
HuggingFaceTB/SmolLM2-1.7B-Instruct
Text Generation
•
Updated
11 days ago
•
65.6k
•
472
liked
2 Spaces
3 months ago
Running
53
📝
Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks
Running
95
📖
TxT360: Trillion Extracted Text
liked
a model
4 months ago
cis-lmu/glotlid
Text Classification
•
Updated
Oct 26, 2024
•
7.35k
•
54
liked
a dataset
4 months ago
tiiuae/falcon-refinedweb
Viewer
•
Updated
Jun 20, 2023
•
968M
•
20.5k
•
826
liked
a Space
6 months ago
Running
371
🧽
Finegrain Object Eraser
Erase any object just by naming it!
liked
3 models
6 months ago
HuggingFaceTB/SmolLM-1.7B
Text Generation
•
Updated
Oct 16, 2024
•
12.7k
•
165
HuggingFaceTB/SmolLM-1.7B-Instruct
Text Generation
•
Updated
Aug 18, 2024
•
47.4k
•
107
AI-MO/NuminaMath-7B-TIR
Text Generation
•
Updated
Aug 14, 2024
•
2.66k
•
327
liked
a dataset
8 months ago
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
12 days ago
•
3.24B
•
181k
•
597
liked
a Space
8 months ago
Running
555
🍷
FineWeb: decanting the web for the finest text data at scale
liked
a dataset
9 months ago
HuggingFaceFW/fineweb
Viewer
•
Updated
15 days ago
•
48.6B
•
269k
•
1.83k
liked
a Space
about 1 year ago
Running
206
🚀
GPT Baker