pinned
Running
874
The Ultra-Scale Playbook
🌌
The ultimate guide to training LLM on large GPU Clusters
Large scale distributed AI model training, model parallelisation, low-level GPU acceleration, make GPUs go brrrrr
The Nanotron team focus on sharing open knowledge and developping open-source libraries for efficient distributed training of large-scale AI models.
Some of its contributions are: