mistralai/Mistral-Small-3.1-24B-Instruct-2503 Image-Text-to-Text ā¢ Updated 7 days ago ā¢ 141k ā¢ ā¢ 1.12k
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. ā¢ 46 items ā¢ Updated Feb 26 ā¢ 587
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper ā¢ 2502.02737 ā¢ Published Feb 4 ā¢ 223