PatrickHaller/hgrn2_pile_100m_distill_babylm Text Generation โข Updated Dec 17, 2024 โข 6.23k โข 1
Running 2.76k 2.76k The Ultra-Scale Playbook ๐ The ultimate guide to training LLM on large GPU Clusters