mradermacher/QLiz-70B-GGUF

About

This model seems badly borked.

If you are unsure how to use GGUF files, refer to one of TheBloke's READMEs for more details, including on how to concatenate multi-part files.

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

Link	Type	Size/GB	Notes
GGUF	Q2_K	25.9
GGUF	IQ3_XS	28.4
GGUF	IQ3_S	30.3	beats Q3_K*
GGUF	Q3_K_S	30.3
GGUF	IQ3_M	31.0
GGUF	Q3_K_M	33.7	lower quality
GGUF	Q3_K_L	36.6
GGUF	IQ4_XS	37.3
GGUF	Q4_K_S	39.7	fast, recommended
GGUF	Q4_K_M	41.8	fast, recommended
GGUF	Q5_K_S	47.9
GGUF	Q5_K_M	49.2
PART 1 PART 2	Q6_K	57.0	very good quality
PART 1 PART 2	Q8_0	73.6	fast, best quality

Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):

See https://huggingface.co/mradermacher/model_requests for some answers to questions you might have and/or if you want some other model quantized.

I thank my company, nethype GmbH, for letting me use its servers and providing upgrades to my workstation to enable this work in my free time.