Thanks!

#1
by ubergarm - opened

Hopefully I can take a look at these over the weekend as I'm doing some benchmarking, I'll follow along in the discussions.

One quick note, myself and some others have started adding ik_llama.cpp tag in the top of the README.md to help folks find your other iqN_k style quants e.g.

---
quantized_by: ubergarm
pipeline_tag: text-generation
base_model: Qwen/Qwen3-30B-A3B
license: apache-2.0
license_link: https://huggingface.co/Qwen/Qwen3-30B-A3B/blob/main/LICENSE
base_model_relation: quantized
tags:
- imatrix
- qwen3_moe
- conversational
- ik_llama.cpp
---

Then you can see all the available compatible quants like so: https://huggingface.co/models?other=ik_llama.cpp

Cheers!

Sign up or log in to comment