Running 2.75k 2.75k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
meta-llama/Llama-3.1-8B-Instruct Text Generation β’ 8B β’ Updated Sep 25, 2024 β’ 5.04M β’ β’ 4.24k
mistralai/Mistral-Nemo-Instruct-2407 Text Generation β’ 12B β’ Updated Nov 6, 2024 β’ 94.4k β’ β’ 1.56k
mistralai/Mistral-7B-Instruct-v0.3 Text Generation β’ 7B β’ Updated Aug 21, 2024 β’ 1.12M β’ β’ 1.85k
meta-llama/Meta-Llama-3-8B-Instruct Text Generation β’ 8B β’ Updated 17 days ago β’ 1.44M β’ β’ 4.04k