Broken model?

by th1nhhdk - opened 18 days ago

Discussion

th1nhhdk

18 days ago

•

edited 17 days ago

I tried using lm studio but the output is "<pad>" repeated over and over

bdutta

MLX Community org 17 days ago

•

edited 17 days ago

Here's what I see...

cheeeaaat

17 days ago

yes, something is broken
model needs an image in context window to work properly

th1nhhdk

17 days ago

Here's what I see...

ah, I misremembered, it's actually<pad>

strifedeeno

MLX Community org 17 days ago

•

edited 17 days ago

I am using lm studio as well and the output is "<pad>" repeated again and again like the previous images. The max context size is also like 4k tokens instead of 128k I think.

pcuenq

MLX Community org 17 days ago

Taking a look.

cheeeaaat

17 days ago

•

edited 17 days ago

@pcuenq
Could you also confirm which base model was used: google/gemma-3-12b-pt or google/gemma-3-12b-it?
I am asking because the model card lists google/gemma-3-12b-it as the base model, but the link in the right menu points to google/gemma-3-12b-pt.

quicklywilliam

MLX Community org 16 days ago

Can confirm this report on gemma-3-27b-it-4bit

bdutta

MLX Community org 16 days ago

•

edited 16 days ago

For me too, it is indeed mlx-community/gemma-3-12b-it-4bit
Also, I can confirm that adding an image to context does make it work, here's a screenshot of the 12B model describing it... impressive.

th1nhhdk

16 days ago

Can confirm that gemma-3-4b-it-4bit have the same problem

d3r-leek

14 days ago

Same issue here for gemma-3-12b-it-4-bit

th1nhhdk

10 days ago

I can confirm that the model now works properly with this commit.

th1nhhdk changed discussion status to closed 10 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment