aghoraguru
/

florence2_lora_vqa

Generated from Trainer

Model card Files Files and versions Community

aghoraguru commited on 25 days ago

Commit

01e5ee1

•

1 Parent(s): e551554

Model save

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -15,6 +15,8 @@ should probably proofread and complete it, then remove this comment. -->
 # florence2_lora_vqa
 This model is a fine-tuned version of [microsoft/Florence-2-base-ft](https://huggingface.co/microsoft/Florence-2-base-ft) on an unknown dataset.
 ## Model description
@@ -39,11 +41,15 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
 ### Framework versions

 # florence2_lora_vqa
 This model is a fine-tuned version of [microsoft/Florence-2-base-ft](https://huggingface.co/microsoft/Florence-2-base-ft) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 4.4793
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 20
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch   | Step | Validation Loss |
+|:-------------:|:-------:|:----:|:---------------:|
+| 5.1294        | 7.0423  | 500  | 4.7076          |
+| 4.8355        | 14.0845 | 1000 | 4.4793          |
 ### Framework versions