Final GLM-Z1-9B-0414-GGUF fixes!

#1
by danielhanchen - opened

Hey guys we reuploaded the quants with more fixes. Hopefully it's the final fixes! Please use --jinja

If you don't use --jinja, which applies the chat template, then you will get gibberish!

Results should be much better so let us know!:

./llama.cpp/llama-cli -hf unsloth/GLM-Z1-9B-0414-GGUF:Q4_K_XL -ngl 99 --jinja

Thank you!

danielhanchen pinned discussion

Sign up or log in to comment