DeepSeek testing - a gaunernst Collection

gaunernst 's Collections

DeepSeek testing

Gemma 3 QAT INT4 (from GGUF)

Gemma 3 QAT INT4 (from Flax)

Mini BERT models

Face Recognition Models

Smallish LLM pre-training datasets

Llama2-compatible

Llama3-compatible

DeepSeek testing

updated Apr 10

A collection of MoE+MLA models, serving as testing proxies for DeepSeek-V3/R1

deepseek-ai/DeepSeek-V2-Lite-Chat

Text Generation • 16B • Updated Jun 25, 2024 • 103k • 125
gaunernst/DeepSeek-V2-Lite-Chat-FP8

16B • Updated Apr 7 • 852
TechxGenus/DeepSeek-V2-Lite-Chat-AWQ

Text Generation • 3B • Updated Jul 4, 2024 • 55 • 2
deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 786k • • 12.6k
meituan/DeepSeek-R1-Block-INT8

Text Generation • 685B • Updated Feb 27 • 1.54k • 45
meituan/DeepSeek-R1-Channel-INT8

Text Generation • Updated Feb 27 • 4.93k • 30
QuixiAI/DeepSeek-V3-AWQ

Text Generation • Updated Mar 29 • 1.02k • 35
ISTA-DASLab/DeepSeek-R1-GPTQ-4b-128g-experts

Text Generation • Updated Apr 8 • 11 • 3