ToastyPigeon
/

QwQ-32B-Snowdrop-v0-EmbedFix

Model card Files Files and versions Community

ToastyPigeon commited on Mar 10

Commit

15eb831

·

verified ·

1 Parent(s): eee7370

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -1,9 +1,9 @@
 This is [trashpanda-org/QwQ-32B-Snowdrop-v0](https://huggingface.co/trashpanda-org/QwQ-32B-Snowdrop-v0) with the `embed_tokens` and `lm_head` tensors replaced with the correctly-sized ones from [Qwen/Qwen2.5-32B-Instruct](https://huggingface.co/Qwen/Qwen2.5-32B-Instruct).
-At the time of posting there's an ongoing issue where the Qwen2.5 embedding tensors have dimension `152064` (matching the vocab size stated in the config), but the actual tokenizer and vocab included have fewer tokens defined (seemingly Qwen pre-initialized extra embed space for future added tokens). Some LLM software (e.g. Axolotl, Mergekit) have this trigger an automated check and, seeing that the vocab size is less than the embed size, resize the embeddings to match, which breaks compatibility in some places.
 (Why the instruct model and not QwQ? Because that's the tokenizer trashpanda was aiming for.)
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
 import torch

 This is [trashpanda-org/QwQ-32B-Snowdrop-v0](https://huggingface.co/trashpanda-org/QwQ-32B-Snowdrop-v0) with the `embed_tokens` and `lm_head` tensors replaced with the correctly-sized ones from [Qwen/Qwen2.5-32B-Instruct](https://huggingface.co/Qwen/Qwen2.5-32B-Instruct).
 (Why the instruct model and not QwQ? Because that's the tokenizer trashpanda was aiming for.)
+At the time of posting there's an ongoing issue where the Qwen2.5 embedding tensors have dimension `152064` (matching the vocab size stated in the config), but the actual tokenizer and vocab included have fewer tokens defined (seemingly Qwen pre-initialized extra embed space for future added tokens). Some LLM software (e.g. Axolotl, Mergekit) have this trigger an automated check and, seeing that the vocab size is less than the embed size, resize the embeddings to match, which breaks compatibility in some places.
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
 import torch