flan-t5-large_question_answering_finetuining

Files changed (4) hide show

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/flan-t5-large](https://huggingface.co/google/flan-t5-large) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.5206
 ## Model description
@@ -35,19 +35,27 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.0003
-- train_batch_size: 1
-- eval_batch_size: 1
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 2
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 3.3897        | 1.0   | 35   | 2.0372          |
-| 1.8817        | 2.0   | 70   | 1.5206          |
 ### Framework versions

 This model is a fine-tuned version of [google/flan-t5-large](https://huggingface.co/google/flan-t5-large) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5861
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 0.0003
+- train_batch_size: 4
+- eval_batch_size: 4
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 4.3031        | 1.0   | 79   | 0.5110          |
+| 0.4051        | 2.0   | 158  | 0.4330          |
+| 0.2949        | 3.0   | 237  | 0.4171          |
+| 0.2191        | 4.0   | 316  | 0.4090          |
+| 0.165         | 5.0   | 395  | 0.4273          |
+| 0.1199        | 6.0   | 474  | 0.4527          |
+| 0.0871        | 7.0   | 553  | 0.4851          |
+| 0.064         | 8.0   | 632  | 0.5375          |
+| 0.0497        | 9.0   | 711  | 0.5662          |
+| 0.044         | 10.0  | 790  | 0.5861          |
 ### Framework versions

emissions.csv CHANGED Viewed

@@ -7,3 +7,4 @@ timestamp,experiment_id,project_name,duration,emissions,energy_consumed,country_
 2024-02-17T15:19:25,60279b1c-2b5f-46ac-a314-3fd14f7593d4,codecarbon,22.601327419281006,0.0014373871409741098,0.002688287132063059,Canada,CAN,,N,,
 2024-02-17T15:29:58,0d75896e-078a-466e-8050-ea2e72a83c48,codecarbon,44.81913948059082,0.003049913346454106,0.005704129784842451,Canada,CAN,,N,,
 2024-02-17T15:42:11,5ecb3722-5983-4772-b503-aa34e1e20f3f,codecarbon,67.49829816818237,0.0038415553250788484,0.00718470581315977,Canada,CAN,,N,,

 2024-02-17T15:19:25,60279b1c-2b5f-46ac-a314-3fd14f7593d4,codecarbon,22.601327419281006,0.0014373871409741098,0.002688287132063059,Canada,CAN,,N,,
 2024-02-17T15:29:58,0d75896e-078a-466e-8050-ea2e72a83c48,codecarbon,44.81913948059082,0.003049913346454106,0.005704129784842451,Canada,CAN,,N,,
 2024-02-17T15:42:11,5ecb3722-5983-4772-b503-aa34e1e20f3f,codecarbon,67.49829816818237,0.0038415553250788484,0.00718470581315977,Canada,CAN,,N,,
+2024-02-17T15:51:16,2689fbc8-0b16-4e86-8e78-73a6971f21aa,codecarbon,452.72942876815796,0.03205908639002386,0.05995881481838818,Canada,CAN,,N,,

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3082d8ffc8a0cab95d43fc664096ed240c5cfa58d9bc5e413a522855711f5a4f
 size 3132668808

 version https://git-lfs.github.com/spec/v1
+oid sha256:4d60fc4ea19d74aff25b16b66fa70252222dc9962edef1954646f8ddbcd86c8c
 size 3132668808

runs/Feb17_15-43-37_c0c8f2aaa2e2/events.out.tfevents.1708184621.c0c8f2aaa2e2.129401.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b1b158720b3a3d7720d73b804ce8b180aeac7fd13269b28b8085fad331b1f89a
-size 7328

 version https://git-lfs.github.com/spec/v1
+oid sha256:f5a5849488d40b29bf5cebddea01e1dd36da2b37fe488f51c1de53951c812377
+size 9394