Running 2.75k 2.75k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
meta-llama/Llama-3.3-70B-Instruct Text Generation β’ 71B β’ Updated Dec 21, 2024 β’ 692k β’ β’ 2.42k
meta-llama/Llama-3.2-3B-Instruct Text Generation β’ 3B β’ Updated Oct 24, 2024 β’ 1.37M β’ β’ 1.57k