common-dataset
updated
HuggingFaceH4/ultrachat_200k
Viewer
• Updated • 515k • 43.8k
• 689
Text Generation
• 7B • Updated • 3.75k
• 320
shareAI/ShareGPT-Chinese-English-90k
Preview
• Updated • 1.11k
• 278
Viewer
• Updated • 207M • 28.7k
• 494
lmsys/chatbot_arena_conversations
Viewer
• Updated • 33k • 44.7k
• 453
Viewer
• Updated • 968M • 42.3k
• 904
WizardLMTeam/WizardLM_evol_instruct_70k
Viewer
• Updated • 70k • 1.35k
• 196
LargeWorldModel/LWM-Text-Chat-1M
Text Generation
• Updated • 1.06k
• 174
Updated • 1.11k
• 123
microsoft/orca-math-word-problems-200k
Viewer
• Updated • 200k • 11.8k
• 479
Preview
• Updated • 75
• 27
Viewer
• Updated • 52.5B • 628k
• 2.75k
Yukang/LongAlpaca-16k-length
Viewer
• Updated • 6.28k • 23
• 25
Viewer
• Updated • 51.8k • 31.6k
• 811
Viewer
• Updated • 343M • 752
• 10
NousResearch/json-mode-eval
Viewer
• Updated • 100 • 369
• 42
NousResearch/func-calling-eval-singleturn
Viewer
• Updated • 112 • 10
• 7
NousResearch/func-calling-eval-glaive
Viewer
• Updated • 100 • 7
• 8
legacy-datasets/wikipedia
Updated • 88k
• 619
Viewer
• Updated • 10.4B • 633k
• 547
open-web-math/open-web-math
Viewer
• Updated • 6.32M • 18k
• 333
codeparrot/github-code-clean
Viewer
• Updated • 11M • 20.3k
• 137
HuggingFaceFW/fineweb-edu-score-2
Viewer
• Updated • 13.9B • 30.3k
• 85
HuggingFaceFW/fineweb-edu
Viewer
• Updated • 3.5B • 366k
• 1.03k
Viewer
• Updated • 52k • 89.6k
• 943
Viewer
• Updated • 772k • 54
• 26
YeungNLP/WizardLM_evol_instruct_V2_143k
Viewer
• Updated • 143k • 6
• 11
Viewer
• Updated • 2.94M • 24.5k
• 1.52k
WizardLMTeam/WizardLM_evol_instruct_V2_196k
Viewer
• Updated • 143k • 4.28k
• 247
timdettmers/openassistant-guanaco
Viewer
• Updated • 10.4k • 13.8k
• 441
garage-bAInd/Open-Platypus
Viewer
• Updated • 24.9k • 12.8k
• 416
Viewer
• Updated • 3.71M • 1.16M
• 666
Updated • 333
• 225
Salesforce/xlam-function-calling-60k
Viewer
• Updated • 60k • 9.93k
• 599
HuggingFaceTB/smollm-corpus
Viewer
• Updated • 237M • 46.4k
• 449
glaiveai/glaive-function-calling-v2
Viewer
• Updated • 113k • 16.9k
• 499
mlfoundations/dclm-baseline-1.0-parquet
Viewer
• Updated • 2.73B • 10.4k
• 36
mlfoundations/dclm-baseline-1.0
Preview
• Updated • 143k
• 262
ruslanmv/ai-medical-chatbot
Viewer
• Updated • 257k • 1.19k
• 247
Viewer
• Updated • 100k • 10.2k
• 264
Viewer
• Updated • 69.9k • 222k
• 390
xzuyn/manythings-translations-alpaca
Viewer
• Updated • 6.33M • 13
• 8
Viewer
• Updated • 21.9M • 4.24k
• 711
Viewer
• Updated • 1.75M • 269
• 105
mlabonne/open-perfectblend
Viewer
• Updated • 1.42M • 1.69k
• 72
mlabonne/orca-agentinstruct-1M-v1-cleaned
Viewer
• Updated • 1.05M • 124
• 67
allenai/tulu-3-sft-mixture
Viewer
• Updated • 939k • 16.3k
• 235
NovaSky-AI/Sky-T1_data_17k
Viewer
• Updated • 16.4k • 3.15k
• 186
Viewer
• Updated • 552M • 650
• 2
Viewer
• Updated • 78.1M • 675
• 6
Viewer
• Updated • 1.13M • 237
• 11
Viewer
• Updated • 16.2M • 228
• 1
Viewer
• Updated • 172k • 75
• 2
Viewer
• Updated • 62.3k • 48
• 2
Viewer
• Updated • 72.1k • 24
• 1
lianghsun/tw-instruct-500k
Viewer
• Updated • 500k • 140
• 24