Smol Models - a BjornMelin Collection

BjornMelin 's Collections

Courses & Certifications

Single 4090 Laptop GPU

Mistral

Legendary VL Models

Google

Llama

Qwen

LLMs

Smol Models

updated May 5

My favorite smaller models under 10B parameters.

nvidia/Llama-3.1-Nemotron-Nano-8B-v1

Text Generation • 8B • Updated May 8 • 393k • • 188
deepseek-ai/DeepSeek-R1-Distill-Llama-8B

Text Generation • 8B • Updated Feb 24 • 1.21M • • 762
bartowski/DeepSeek-R1-Distill-Qwen-7B-GGUF

Text Generation • 8B • Updated Mar 7 • 15.1k • 98
Qwen/Qwen2.5-Coder-7B-Instruct

Text Generation • 8B • Updated Jan 12 • 273k • • 498
Qwen/Qwen2.5-7B-Instruct

Text Generation • 8B • Updated Jan 12 • 2.09M • • 712
VIDraft/Gemma-3-R1984-4B

Image-Text-to-Text • 4B • Updated Apr 10 • 2.2k • 20
meta-llama/Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Sep 25, 2024 • 5.18M • • 4.22k
meta-llama/Llama-3.2-3B-Instruct

Text Generation • 3B • Updated Oct 24, 2024 • 1.4M • • 1.57k
openbmb/MiniCPM-o-2_6

Any-to-Any • 9B • Updated 13 days ago • 94.3k • 1.18k
microsoft/Phi-4-mini-reasoning

Text Generation • 4B • Updated May 1 • 22.9k • 180