Running 115 115 TxT360: Trillion Extracted Text 📖 Create a large, deduplicated dataset for LLM pre-training
view post Post 4188 Please check the Open Source AI Network: we mapped the top 500 HF usersbased on their followers' profiles.The map can be found here: bunkalab/mapping_the_OS_community 1 reply · 🔥 14 14 🤯 2 2 + Reply
bunkalab/Phi-3-mini-128k-instruct-LinearBunkaScore-4.6k-DPO Text Generation • 4B • Updated May 30, 2024 • 19 • 2