Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
whr94621
's Collections
LLM_LongContext
LLM_Eval
LLM_Alignment
LLM_Pretrain
LLM_Multilingual
llm_datasets_japanese
llm_datasets_multi
llm_datasets_arabic
llm_synthesis_data
llm_datasets_id
llm_datasets_translation
llm_models_pretrain
llm_datasets_korean
llm_datasets_vi
llm_datasets_ru
llm_datasets_th
curated_sft_data
llm_datasets_multi
updated
May 15
同时设计多种语言的数据集
Upvote
-
SEACrowd/x_fact
Updated
Jun 24
•
65
•
1
juletxara/xstory_cloze
Viewer
•
Updated
May 21, 2023
•
20.6k
•
662
•
9
juletxara/xstory_cloze_mt
Updated
Jul 21, 2023
•
824
miracl/nomiracl
Updated
Feb 26
•
262
•
10
ayymen/Pontoon-Translations
Viewer
•
Updated
Jan 19
•
3.56M
•
1.48k
•
11
biglam/europeana_newspapers
Viewer
•
Updated
Jan 31
•
5.94M
•
1.24k
•
41
PleIAs/French-PD-Newspapers
Viewer
•
Updated
Mar 19
•
2.25M
•
744
•
61
ontocord/CulturaY
Viewer
•
Updated
Mar 30
•
33.2M
•
1.46k
•
27
Shitao/MLDR
Updated
Feb 6
•
1.16k
•
58
joelniklaus/Multi_Legal_Pile_Commercial
Updated
Oct 18, 2023
•
40
•
8
joelniklaus/eurlex_resources
Updated
May 10, 2023
•
326
•
8
CohereForAI/c4ai-command-r-v01
Text Generation
•
Updated
Sep 27
•
7.41k
•
1.07k
carolina-c4ai/corpus-carolina
Updated
25 days ago
•
411
•
21
eduagarcia/LegalPT_dedup
Viewer
•
Updated
May 7
•
23.9M
•
1.78k
•
15
PleIAs/YouTube-Commons
Updated
Jun 26
•
813
•
318
Upvote
-
Share collection
View history
Collection guide
Browse collections