Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

meituan
/
DeepSeek-R1-Block-INT8

Text Generation
Transformers
Safetensors
deepseek_v3
conversational
custom_code
text-generation-inference
8-bit precision
blockwise_int8
Model card Files Files and versions Community
21
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

量化类型

#21 opened about 1 month ago by
WYBJ

DeepSeek-V3-0324 int8 garbled

#20 opened about 1 month ago by
zchflyer

4-bits

#19 opened about 2 months ago by
zhnagchenchne

Weight output_partition_size = 576 is not divisible by weight quantization block_n = 128

#18 opened 2 months ago by
yuwanpeng

Optimal `weight_block_size` for Intel AMX `amx_int8` `amx_tile`?

1
#17 opened 2 months ago by
ubergarm

what about `ollama`?

#16 opened 2 months ago by
ice6

是否有明确的sglang镜像版本推荐:)

1
#14 opened 2 months ago by
wangkkk956

After deploying with the latest sglang, I found that the responses when calling the interface were chaotic.

4
#13 opened 2 months ago by
ShiningMaker
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs