Kallia commited on
Commit
8586acf
·
verified ·
1 Parent(s): 000c109

Training complete

Browse files
Files changed (2) hide show
  1. README.md +14 -11
  2. model.safetensors +1 -1
README.md CHANGED
@@ -19,11 +19,11 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 0.8158
23
- - Rouge1: 43.5332
24
- - Rouge2: 35.3228
25
- - Rougel: 41.6098
26
- - Rougelsum: 42.0283
27
 
28
  ## Model description
29
 
@@ -48,17 +48,20 @@ The following hyperparameters were used during training:
48
  - seed: 42
49
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
50
  - lr_scheduler_type: linear
51
- - num_epochs: 5
52
 
53
  ### Training results
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
56
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
57
- | 0.9339 | 1.0 | 567 | 0.8689 | 43.605 | 35.245 | 41.6921 | 42.0047 |
58
- | 0.8141 | 2.0 | 1134 | 0.8350 | 43.92 | 35.795 | 42.1339 | 42.4823 |
59
- | 0.7831 | 3.0 | 1701 | 0.8228 | 43.9384 | 35.7565 | 42.1371 | 42.4679 |
60
- | 0.7617 | 4.0 | 2268 | 0.8173 | 43.4454 | 35.2275 | 41.5476 | 41.9424 |
61
- | 0.7492 | 5.0 | 2835 | 0.8158 | 43.5332 | 35.3228 | 41.6098 | 42.0283 |
 
 
 
62
 
63
 
64
  ### Framework versions
 
19
 
20
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 0.8004
23
+ - Rouge1: 44.167
24
+ - Rouge2: 36.0987
25
+ - Rougel: 42.3071
26
+ - Rougelsum: 42.7469
27
 
28
  ## Model description
29
 
 
48
  - seed: 42
49
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
50
  - lr_scheduler_type: linear
51
+ - num_epochs: 8
52
 
53
  ### Training results
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
56
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
57
+ | 0.9333 | 1.0 | 567 | 0.8685 | 43.2839 | 35.0416 | 41.431 | 41.724 |
58
+ | 0.8108 | 2.0 | 1134 | 0.8320 | 43.8366 | 35.6114 | 42.0042 | 42.3429 |
59
+ | 0.7762 | 3.0 | 1701 | 0.8183 | 43.6494 | 35.4373 | 41.755 | 42.1571 |
60
+ | 0.7487 | 4.0 | 2268 | 0.8084 | 43.8891 | 35.6917 | 41.9799 | 42.3663 |
61
+ | 0.7289 | 5.0 | 2835 | 0.8036 | 43.7749 | 35.5598 | 41.851 | 42.2422 |
62
+ | 0.7166 | 6.0 | 3402 | 0.8018 | 44.1163 | 36.0261 | 42.335 | 42.6929 |
63
+ | 0.7014 | 7.0 | 3969 | 0.8015 | 44.0648 | 35.8877 | 42.1801 | 42.6115 |
64
+ | 0.7119 | 8.0 | 4536 | 0.8004 | 44.167 | 36.0987 | 42.3071 | 42.7469 |
65
 
66
 
67
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3b3c7ab2f852a6de010d0f7f8130ac14577623a84d21b41ae7166cf261c5f8fb
3
  size 242041896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1ceaa294c074d4baf42eeb111f9d6e76412e6c14945b665042213b4f2ca605dc
3
  size 242041896