nvidia/Llama-3_3-Nemotron-Super-49B-v1_5 Text Generation β’ 50B β’ Updated 19 days ago β’ 18.2k β’ 191
lightonai/Reason-ModernColBERT Sentence Similarity β’ 0.1B β’ Updated about 4 hours ago β’ 3.09k β’ 204
nvidia/Llama-3.1-Nemotron-8B-UltraLong-4M-Instruct Text Generation β’ 8B β’ Updated Apr 17 β’ 5.46k β’ 119
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B Text Generation β’ 8B β’ Updated Feb 24 β’ 1.48M β’ β’ 713
Running 3.17k 3.17k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation β’ 2B β’ Updated Feb 24 β’ 583k β’ β’ 1.33k