Final GLM-Z1-9B-0414-GGUF fixes!
#1
pinned
by
danielhanchen
- opened
Hey guys we reuploaded the quants with more fixes. Hopefully it's the final fixes! Please use --jinja
If you don't use
--jinja
, which applies the chat template, then you will get gibberish!
Results should be much better so let us know!:
./llama.cpp/llama-cli -hf unsloth/GLM-Z1-9B-0414-GGUF:Q4_K_XL -ngl 99 --jinja
Thank you!
danielhanchen
pinned discussion