Report

#1
by olegshulyakov - opened
MLX Community org

It looks like the model is not quantized to 4 bit.

Sign up or log in to comment