Open Datasets
updated
Updated
•
239
•
86
fka/awesome-chatgpt-prompts
Viewer
•
Updated
•
1.11k
•
17.5k
•
9.58k
Viewer
•
Updated
•
470M
•
36.6k
•
334
Viewer
•
Updated
•
2.2M
•
4.96k
•
387
Matthijs/cmu-arctic-xvectors
Viewer
•
Updated
•
7.93k
•
18.7k
•
62
parler-tts/libritts-r-filtered-speaker-descriptions
Viewer
•
Updated
•
359k
•
93
•
7
Viewer
•
Updated
•
860k
•
10.7k
•
531
alpindale/two-million-bluesky-posts
Viewer
•
Updated
•
2.11M
•
525
•
200
arimalabs/2.3-million-bluesky-posts
Viewer
•
Updated
•
2.37M
•
23
•
5
Viewer
•
Updated
•
70k
•
69.6k
•
223
Viewer
•
Updated
•
1.34M
•
1.86k
•
30
Viewer
•
Updated
•
1.12M
•
685
•
4
parler-tts/libritts_r_filtered
Viewer
•
Updated
•
359k
•
2.29k
•
21
opendiffusionai/cc12m-cleaned
Viewer
•
Updated
•
8.53M
•
378
•
10
Viewer
•
Updated
•
31.4k
•
429
•
22
Preview
•
Updated
•
99
•
7
Viewer
•
Updated
•
61.6M
•
80.2k
•
1.13k
parler-tts/mls-eng-speaker-descriptions
Viewer
•
Updated
•
10.8M
•
60
•
10
Viewer
•
Updated
•
111M
•
623
•
99
Updated
•
21
•
2
Viewer
•
Updated
•
602k
•
11.6k
•
144
Viewer
•
Updated
•
4.48B
•
109k
•
729
Viewer
•
Updated
•
1.55k
•
10
•
4
Updated
•
6.5k
•
139
Viewer
•
Updated
•
59.1k
•
328
•
12
keremberke/license-plate-object-detection
Viewer
•
Updated
•
8.83k
•
774
•
34
Updated
•
38
•
8
Viewer
•
Updated
•
98.6k
•
2.81k
•
100
nebius/SWE-agent-trajectories
Viewer
•
Updated
•
80k
•
353
•
67
Viewer
•
Updated
•
3.4k
•
2.36k
•
56
cfahlgren1/react-code-instructions
Viewer
•
Updated
•
74.4k
•
141
•
156
DAMO-NLP-SG/multimodal_textbook
Updated
•
743
•
156
NovaSky-AI/Sky-T1_data_17k
Viewer
•
Updated
•
16.4k
•
121
•
187
Viewer
•
Updated
•
5.45B
•
5.43k
•
449
Viewer
•
Updated
•
546M
•
16.3k
•
938
hoskinson-center/proof-pile
Viewer
•
Updated
•
363k
•
869
•
63
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
•
3.5B
•
348k
•
922
EleutherAI/the_pile_deduplicated
Viewer
•
Updated
•
134M
•
9.73k
•
107
MohamedRashad/multilingual-tts
Viewer
•
Updated
•
25.5k
•
73
•
47
Viewer
•
Updated
•
16.4k
•
7
•
4
facebook/multilingual_librispeech
Viewer
•
Updated
•
1.49M
•
7.66k
•
167
Viewer
•
Updated
•
1.25M
•
11.1k
•
85
Viewer
•
Updated
•
2.77M
•
3.8k
•
113
Fumika/Wikinews-multilingual
Viewer
•
Updated
•
15.2k
•
58
•
7
ayymen/Weblate-Translations
Viewer
•
Updated
•
11.7M
•
882
•
16
Updated
•
220k
•
153
Helsinki-NLP/opus_wikipedia
Viewer
•
Updated
•
1.75M
•
77
•
10
Viewer
•
Updated
•
3.59M
•
30
•
1
MLCommons/unsupervised_peoples_speech
Updated
•
26.3k
•
69
HKUSTAudio/Llasa_opensource_speech_data_160k_hours_tokenized
Updated
•
89
•
30
Viewer
•
Updated
•
10k
•
4.25k
•
529
Viewer
•
Updated
•
68.1k
•
61.9k
•
21
allenai/RLVR-GSM-MATH-IF-Mixed-Constraints
Viewer
•
Updated
•
29.9k
•
1.32k
•
30
allenai/olmo-2-0325-32b-preference-mix
Updated
•
150
•
15
allenai/tulu-3-sft-olmo-2-mixture-0225
Viewer
•
Updated
•
866k
•
1.01k
•
22
Viewer
•
Updated
•
170M
•
46.4k
•
90
Viewer
•
Updated
•
621M
•
20.4k
•
86
Viewer
•
Updated
•
932
•
15.5k
•
601
Congliu/Chinese-DeepSeek-R1-Distill-data-110k
Viewer
•
Updated
•
110k
•
411
•
722
Viewer
•
Updated
•
102k
•
276
•
47
Viewer
•
Updated
•
450k
•
13.7k
•
697
Viewer
•
Updated
•
167M
•
2.72k
•
63