Thanks!
#1
by
ubergarm
- opened
Hopefully I can take a look at these over the weekend as I'm doing some benchmarking, I'll follow along in the discussions.
One quick note, myself and some others have started adding ik_llama.cpp
tag in the top of the README.md to help folks find your other iqN_k
style quants e.g.
---
quantized_by: ubergarm
pipeline_tag: text-generation
base_model: Qwen/Qwen3-30B-A3B
license: apache-2.0
license_link: https://huggingface.co/Qwen/Qwen3-30B-A3B/blob/main/LICENSE
base_model_relation: quantized
tags:
- imatrix
- qwen3_moe
- conversational
- ik_llama.cpp
---
Then you can see all the available compatible quants like so: https://huggingface.co/models?other=ik_llama.cpp
Cheers!