Finetuned model has smaller model.safetensors size

#8
by nam-withpi - opened

I fine-tuned this model for a sequence classification task but the model.safetensors file of the fine-tuned checkpoint is much smaller than the base model, 2.44GB (fine-tuned checkpoint) versus 3.02GB (based model).

Also when I eval the checkpoint model from the saved checkpoint, the checkpoint performance is different from what reported in the during the training loop. I suspect that somehow the saved model is missing module weights.

Please help with this issue. Much appreciated!

Screenshot 2025-03-25 at 11.14.54 PM.png

EuroBERT org

Hey @nam-withpi , could you share your training and saving code with us? We'll take a look at it 🙌

I fine-tuned this model for a sequence classification task but the model.safetensors file of the fine-tuned checkpoint is much smaller than the base model, 2.44GB (fine-tuned checkpoint) versus 3.02GB (based model).

Also when I eval the checkpoint model from the saved checkpoint, the checkpoint performance is different from what reported in the during the training loop. I suspect that somehow the saved model is missing module weights.

Please help with this issue. Much appreciated!

Screenshot 2025-03-25 at 11.14.54 PM.png

Have you tried for information retrieval?

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment