mlx-community/Qwen3-Embedding-8B-4bit-DWQ · ValueError: Missing parameters: lm

Jun 22

Thanks for the conversion.

I tried to run this model with mlx-lm, but encountered ValueError: Missing parameters: lm_head.weight..
With the same code, the 4B-4bit-DWQ works fine. Is something missing for this 8B model?

mzbac

MLX Community org Jun 22

•

edited Jun 22

Currently, the mlx-lm doesn't support embedding models. I used my Swift version of mlx embeddings for the embedding model, which can be found at https://github.com/mzbac/mlx.embeddings. If you are working with Python, you may need to modify the mlx-lm by loading the model with the lm_head removed and do last token pooling. you can have a look of the gist here see https://gist.github.com/mzbac/96d8427542496fd75ca67849b5cb6ce9

By the way, the root cause of 8B not working with mlx-lm is 8B has tie_word_embeddings set to false. As a result, mlx-lm attempts to load the lm_head, which does not exist in the embedding model. I think the embedding model will never need the lm_head layer as it uses the hidestate from encoding layers, so it would be better for you to implement your own embedding model instead of using mlx-lm's default one. An easy workaround might be changing the config.json file to set tie_word_embeddings to true.

mzbac

MLX Community org Jun 22

Here's a clearer example of how to load Qwen3 embedding using MLX-LM: https://gist.github.com/mzbac/67d92c2cfe8bcf75579ac55144d1174f