Why is the model size of safetensor 5.23B parameters?
#6 opened 3 days ago
by
oilbread
Can you share the GPTQ quantization code?
#5 opened about 2 months ago
by
qwertist
Produce gibberish with dtype=auto
#4 opened about 2 months ago
by
divisingh
QAT version
๐ฅ
1
#3 opened 2 months ago
by
Delnith
vLLM on 24gb gpu
๐
2
#2 opened 4 months ago
by
roadtoagi
