Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

QuantTrio
/
Qwen3-235B-A22B-Thinking-2507-GPTQ-Int4-Int8Mix

Text Generation
Transformers
Safetensors
qwen3_moe
Qwen3
GPTQ
Int4-Int8Mix
量化修复
vLLM
conversational
4-bit precision
gptq
Model card Files Files and versions Community
4
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Upgrade to 1m context

#4 opened 11 days ago by
freegheist

Help: Trying to load on 2x 6000 pro 96gb

3
#3 opened 21 days ago by
Fernanda24

Failed with v0.9.2 on 8x2080Ti 22GB

2
#2 opened 25 days ago by
xydarcher

Unable to load with vllm, "model.safetensors.index.json" contains incorrect files.

1
#1 opened 28 days ago by
ac101m
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs