PatrickHaller/hgrn2_pile_100m_distill_babylm Text Generation โข Updated Dec 17, 2024 โข 6.97k โข 1
Running 2.47k 2.47k The Ultra-Scale Playbook ๐ The ultimate guide to training LLM on large GPU Clusters