Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
gaunernst 's Collections
DeepSeek testing
Gemma 3 QAT INT4 (from GGUF)
Gemma 3 QAT INT4 (from Flax)
Mini BERT models
Face Recognition Models
LLMs < 1B
LLMs 1B - 2B
LLMs 2B - 4B
Smallish LLM pre-training datasets
Llama2-compatible
Llama3-compatible

DeepSeek testing

updated Apr 10

A collection of MoE+MLA models, serving as testing proxies for DeepSeek-V3/R1

Upvote
-

  • deepseek-ai/DeepSeek-V2-Lite-Chat

    Text Generation • 16B • Updated Jun 25, 2024 • 72.1k • 125

  • gaunernst/DeepSeek-V2-Lite-Chat-FP8

    16B • Updated Apr 7 • 1.49k

  • TechxGenus/DeepSeek-V2-Lite-Chat-AWQ

    Text Generation • 3B • Updated Jul 4, 2024 • 830 • 2

  • deepseek-ai/DeepSeek-R1

    Text Generation • 685B • Updated Mar 27 • 833k • • 12.5k

  • meituan/DeepSeek-R1-Block-INT8

    Text Generation • 685B • Updated Feb 27 • 982 • 45

  • meituan/DeepSeek-R1-Channel-INT8

    Text Generation • Updated Feb 27 • 6.21k • 28

  • QuixiAI/DeepSeek-V3-AWQ

    Text Generation • Updated Mar 29 • 1.64k • 34

  • ISTA-DASLab/DeepSeek-R1-GPTQ-4b-128g-experts

    Text Generation • Updated Apr 8 • 6 • 3
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs