Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
BjornMelin 's Collections
Datasets
Courses & Certifications
Single 4090 Laptop GPU
Fine Tuning
Leaderboards
Mistral
Legendary VL Models
Coding Models
Smol Quantized
Smol Models
Google
Llama
Qwen
LLMs

Smol Models

updated May 5

My favorite smaller models under 10B parameters.

Upvote
1

  • nvidia/Llama-3.1-Nemotron-Nano-8B-v1

    Text Generation • 8B • Updated May 8 • 393k • • 188

  • deepseek-ai/DeepSeek-R1-Distill-Llama-8B

    Text Generation • 8B • Updated Feb 24 • 1.21M • • 762

  • bartowski/DeepSeek-R1-Distill-Qwen-7B-GGUF

    Text Generation • 8B • Updated Mar 7 • 15.1k • 98

  • Qwen/Qwen2.5-Coder-7B-Instruct

    Text Generation • 8B • Updated Jan 12 • 273k • • 498

  • Qwen/Qwen2.5-7B-Instruct

    Text Generation • 8B • Updated Jan 12 • 2.09M • • 712

  • VIDraft/Gemma-3-R1984-4B

    Image-Text-to-Text • 4B • Updated Apr 10 • 2.2k • 20

  • meta-llama/Llama-3.1-8B-Instruct

    Text Generation • 8B • Updated Sep 25, 2024 • 5.18M • • 4.22k

  • meta-llama/Llama-3.2-3B-Instruct

    Text Generation • 3B • Updated Oct 24, 2024 • 1.4M • • 1.57k

  • openbmb/MiniCPM-o-2_6

    Any-to-Any • 9B • Updated 13 days ago • 94.3k • 1.18k

  • microsoft/Phi-4-mini-reasoning

    Text Generation • 4B • Updated May 1 • 22.9k • 180
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs