llama.cpp and ik_llama.cpp imatrix Quantizations of unsloth/phi-4-GGUF

Imatrix dataset from bartowski1182

llama.cpp

phi-4-IQ2_S.gguf
phi-4-IQ3_XS.gguf
phi-4-Q4_K_M.gguf
phi-4-IQ4_XS.gguf
phi-4-IQ4_NL.gguf

phi-4-IQ4_KS.gguf
phi-4-IQ4_NL_R4.gguf

llama.cpp, ik_llama.cpp, bartowski, microsoft, unsloth, huggingface

GGUF

Model size

14.7B params

Architecture

llama

Hardware compatibility

2-bit

4-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

microsoft/phi-4

Quantized

(126)

this model