Finetuned model has smaller model.safetensors size
I fine-tuned this model for a sequence classification task but the model.safetensors file of the fine-tuned checkpoint is much smaller than the base model, 2.44GB (fine-tuned checkpoint) versus 3.02GB (based model).
Also when I eval the checkpoint model from the saved checkpoint, the checkpoint performance is different from what reported in the during the training loop. I suspect that somehow the saved model is missing module weights.
Please help with this issue. Much appreciated!
Hey @nam-withpi , could you share your training and saving code with us? We'll take a look at it 🙌
I fine-tuned this model for a sequence classification task but the model.safetensors file of the fine-tuned checkpoint is much smaller than the base model, 2.44GB (fine-tuned checkpoint) versus 3.02GB (based model).
Also when I eval the checkpoint model from the saved checkpoint, the checkpoint performance is different from what reported in the during the training loop. I suspect that somehow the saved model is missing module weights.
Please help with this issue. Much appreciated!
Have you tried for information retrieval?