Add essential_web_500k_tokens.txt - Tokenized version (one token per line) of 500K chars from Essential-Web
Browse filesUpload tokenized data files from Essential-Web dataset processing.
File: essential_web_500k_tokens.txt
Description: Tokenized version (one token per line) of 500K chars from Essential-Web
essential_web_500k_tokens.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|