DeepSeek-V3-0324 int8 garbled

#20
by zchflyer - opened

Use the following command to convert DeepSeek-V3-0324-bf16 to int8.
Using sglang for serve, the response is garbled.

python3 bf16_cast_block_int8.py
--input-bf16-hf-path /path/to/deepseek-v3-0324-bf16-weights/
--output-int8-hf-path /path/to/deepseek-v3-0324-int8-weight/

Sign up or log in to comment