IQ4_NL is generating gibberish on llama.cpp

by netroy - opened Jul 30

Discussion

netroy

Jul 30

on IQ4_NL latest llama.cpp is generating complete gibberish on CUDA, on VULKAN, and on CPU-only as well.

shimmyshimmer

Unsloth AI org Jul 30

on IQ4_NL latest llama.cpp is generating complete gibberish on CUDA, on VULKAN, and on CPU-only as well.

Does this happen for Q8 as well for you? I tried IQ4_NL and it works fine for me

danielhanchen

Unsloth AI org Jul 31

@netroy I tried on CUDA again via ./llama.cpp/llama-cli -hf unsloth/Qwen3-30B-A3B-Thinking-2507-GGUF:IQ4_NL -ngl 99 --jinja and it works fine - see screenshot below:

Please try redownloading the model weights as well

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment