How about deepseek v3 model?
#15
by
JohnnyBoyzzz
- opened
Is your team interested in switching?
You should be able to do it yourself with the provided command on this model card:
python3 bf16_cast_block_int8.py \
--input-bf16-hf-path /path/to/bf16-weights/ \
--output-int8-hf-path /path/to/save-int8-weight/