fineweb-1B / README.md
ddh0's picture
Create README.md
443baae verified
|
raw
history blame
101 Bytes

Sample of ~1B tokens from fineweb 15T, tokenized with custom Llama 3.2 1B tokenizer. For personal use