Single Shuffled Data Data shuffled only at the document-level babylm-seqlen/train_100M_128_single_shuffle Viewer • Updated Apr 8 • 1.28M • 5 babylm-seqlen/train_100M_1024_single_shuffle Viewer • Updated Apr 8 • 160k • 4 babylm-seqlen/train_100M_64_single_shuffle Viewer • Updated Apr 8 • 2.56M • 6 babylm-seqlen/train_100M_256_single_shuffle Viewer • Updated Apr 8 • 639k • 4
Double Shuffled Data Data shuffled at both the document-level, and again at the tokenized level babylm-seqlen/train_100M_256 Viewer • Updated Apr 7 • 639k • 2 babylm-seqlen/train_100M_1024 Viewer • Updated Apr 7 • 160k • 4 babylm-seqlen/train_100M_16384 Viewer • Updated Apr 7 • 9.86k • 5 babylm-seqlen/train_100M_4096 Viewer • Updated Apr 7 • 39.8k • 8
Single Shuffled Data Data shuffled only at the document-level babylm-seqlen/train_100M_128_single_shuffle Viewer • Updated Apr 8 • 1.28M • 5 babylm-seqlen/train_100M_1024_single_shuffle Viewer • Updated Apr 8 • 160k • 4 babylm-seqlen/train_100M_64_single_shuffle Viewer • Updated Apr 8 • 2.56M • 6 babylm-seqlen/train_100M_256_single_shuffle Viewer • Updated Apr 8 • 639k • 4
Double Shuffled Data Data shuffled at both the document-level, and again at the tokenized level babylm-seqlen/train_100M_256 Viewer • Updated Apr 7 • 639k • 2 babylm-seqlen/train_100M_1024 Viewer • Updated Apr 7 • 160k • 4 babylm-seqlen/train_100M_16384 Viewer • Updated Apr 7 • 9.86k • 5 babylm-seqlen/train_100M_4096 Viewer • Updated Apr 7 • 39.8k • 8