Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Sailor2
community
Activity Feed
Request to join this org
Follow
147
AI & ML interests
Open language models for South-East Asia
Recent Activity
Cameron-Chen
authored
a paper
9 days ago
GEM: A Gym for Agentic LLMs
afaji
authored
a paper
about 1 month ago
Predicting the Order of Upcoming Tokens Improves Language Modeling
wannaphong
authored
a paper
2 months ago
Mangosteen: An Open Thai Corpus for Language Model Pretraining
View all activity
Team members
117
+83
+70
+49
+39
+19
sailor2
's datasets
16
Sort: Recently updated
sailor2/sea-wildbench
Viewer
•
Updated
Mar 26
•
1.02k
•
27
sailor2/sea-ultrafeedback-onpolicy
Viewer
•
Updated
Feb 16
•
38.3k
•
8
sailor2/Flores-Plus-Evaluation-Log-Preview-Cleaned
Viewer
•
Updated
Jan 22
•
153k
•
10
sailor2/sea-pdf-text
Viewer
•
Updated
Dec 4, 2024
•
32.4M
•
276
•
1
sailor2/sea-internet
Viewer
•
Updated
Dec 4, 2024
•
14.2M
•
184
•
1
sailor2/sailor2-sft-stage1
Viewer
•
Updated
Dec 4, 2024
•
2.73M
•
33
sailor2/sea-commoncrawl
Viewer
•
Updated
Dec 4, 2024
•
494M
•
2.22k
sailor2/sailor2-pretrain-data-stage2
Viewer
•
Updated
Dec 4, 2024
•
51.7M
•
106
sailor2/sailor2-pretrain-data-stage1
Viewer
•
Updated
Dec 4, 2024
•
295M
•
3.03k
sailor2/community-dataset
Viewer
•
Updated
Dec 4, 2024
•
5.17M
•
172
•
1
sailor2/sailor2-sft-stage2
Viewer
•
Updated
Dec 2, 2024
•
531k
•
16
sailor2/sea-ultrafeedback
Viewer
•
Updated
Nov 16, 2024
•
58.5k
•
56
sailor2/sea-commoncrawl-high-quality
Viewer
•
Updated
Nov 1, 2024
•
17.4M
•
648
sailor2/sea-synthetic
Viewer
•
Updated
Oct 30, 2024
•
59.8M
•
5.19k
sailor2/Vietnamese_RAG
Viewer
•
Updated
Jul 16, 2024
•
8.41k
•
42
•
9
sailor2/xcopa
Preview
•
Updated
Jul 2, 2024
•
20