Quantization

#12

by Zelyanoth - opened Mar 12

Mar 12

Hello, did anyone have some issue while trying to load the model in 4 or 8 bits ?

Renu11

Google org 27 days ago

@Zelyanoth , Please have a look at this similar issue where user successfully quantized Gemma 3 27B to Q8 using llama.cpp. I hope this helps!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment