Why is the model size of safetensor 5.23B parameters?
#6 opened about 1 month ago
by
oilbread
Can you share the GPTQ quantization code?
#5 opened 3 months ago
by
qwertist
Produce gibberish with dtype=auto
#4 opened 3 months ago
by
divisingh
QAT version
๐ฅ
1
#3 opened 4 months ago
by
Delnith
vLLM on 24gb gpu
๐
2
#2 opened 5 months ago
by
roadtoagi
