DeepSeek-V3-0324 int8 garbled
#20 opened about 1 month ago
by
zchflyer
4-bits
#19 opened about 2 months ago
by
zhnagchenchne
Weight output_partition_size = 576 is not divisible by weight quantization block_n = 128
#18 opened 2 months ago
by
yuwanpeng
Optimal `weight_block_size` for Intel AMX `amx_int8` `amx_tile`?
1
#17 opened 2 months ago
by
ubergarm
what about `ollama`?
#16 opened 2 months ago
by
ice6
是否有明确的sglang镜像版本推荐:)
1
#14 opened 2 months ago
by
wangkkk956