How did you go about training this model? Did you encounter problem with tokenizer during training?

by radna - opened Mar 18

Mar 18

•

@valoomba I saw your tokenizer config and it seems the tokenizer has changed compared to the original FuseO1 model, I'm experiencing loss to 0 during training, is the tokenizer setting the cause of this? Output is just gibberish also.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment