DeepSeek-V3 Pruning and Quantization
Collection
11 items
•
Updated
Original model: tflsxyy/DeepSeek-V3-0324-MoE-Pruner-E160-bf16
Less activated experts since pruning
"num_experts_per_tok": 6,
"topk_group": 2,
Quants made based on bartowski1182-llama.cpp
4-bit