DeepSeek-V3 Pruning and Quantization
Collection
11 items
•
Updated
Original model: tflsxyy/DeepSeek-V3-0324-MoE-Pruner-E192-bf16.
All quants made based on bartowski1182-llama.cpp.
All quants made using imatrix option based on tflsxyy/DeepSeek-V3-0324-MoE-Pruner-imatrix.
1-bit
2-bit