Commit
·
cfe0071
1
Parent(s):
d01fd7e
Model save
Browse files
README.md
CHANGED
@@ -17,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
|
|
17 |
|
18 |
This model is a fine-tuned version of [google/flan-t5-large](https://huggingface.co/google/flan-t5-large) on an unknown dataset.
|
19 |
It achieves the following results on the evaluation set:
|
|
|
20 |
- Loss: 3.4156
|
21 |
- Rouge1: 20.6216
|
22 |
- Rouge2: 4.1953
|
23 |
- Rougel: 16.091
|
24 |
- Rougelsum: 17.4067
|
25 |
-
- Gen Len: 509.77
|
26 |
|
27 |
## Model description
|
28 |
|
@@ -53,13 +53,13 @@ The following hyperparameters were used during training:
|
|
53 |
|
54 |
### Training results
|
55 |
|
56 |
-
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
|
57 |
-
|
58 |
-
| No log | 1.0 | 218 | 3.5702 | 16.525 | 3.2484 | 14.5949 | 13.3919 |
|
59 |
-
| No log | 2.0 | 436 | 3.5013 | 17.6208 | 3.7153 | 14.7229 | 14.9556 |
|
60 |
-
| No log | 3.0 | 654 | 3.4592 | 19.4782 | 3.9388 | 15.7853 | 16.3468 |
|
61 |
-
| No log | 4.0 | 872 | 3.4337 | 20.8148 | 4.287 | 16.3887 | 17.3557 |
|
62 |
-
| No log | 5.0 | 1090 | 3.4156 | 20.6216 | 4.1953 | 16.091 | 17.4067 |
|
63 |
|
64 |
|
65 |
### Framework versions
|
|
|
17 |
|
18 |
This model is a fine-tuned version of [google/flan-t5-large](https://huggingface.co/google/flan-t5-large) on an unknown dataset.
|
19 |
It achieves the following results on the evaluation set:
|
20 |
+
- Gen Len: 509.77
|
21 |
- Loss: 3.4156
|
22 |
- Rouge1: 20.6216
|
23 |
- Rouge2: 4.1953
|
24 |
- Rougel: 16.091
|
25 |
- Rougelsum: 17.4067
|
|
|
26 |
|
27 |
## Model description
|
28 |
|
|
|
53 |
|
54 |
### Training results
|
55 |
|
56 |
+
| Training Loss | Epoch | Step | Gen Len | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
|
57 |
+
|:-------------:|:-----:|:----:|:--------:|:---------------:|:-------:|:------:|:-------:|:---------:|
|
58 |
+
| No log | 1.0 | 218 | 508.68 | 3.5702 | 16.525 | 3.2484 | 14.5949 | 13.3919 |
|
59 |
+
| No log | 2.0 | 436 | 509.62 | 3.5013 | 17.6208 | 3.7153 | 14.7229 | 14.9556 |
|
60 |
+
| No log | 3.0 | 654 | 509.9767 | 3.4592 | 19.4782 | 3.9388 | 15.7853 | 16.3468 |
|
61 |
+
| No log | 4.0 | 872 | 509.8233 | 3.4337 | 20.8148 | 4.287 | 16.3887 | 17.3557 |
|
62 |
+
| No log | 5.0 | 1090 | 509.77 | 3.4156 | 20.6216 | 4.1953 | 16.091 | 17.4067 |
|
63 |
|
64 |
|
65 |
### Framework versions
|
generation_config.json
CHANGED
@@ -1,7 +1,7 @@
|
|
1 |
{
|
|
|
2 |
"decoder_start_token_id": 0,
|
3 |
"eos_token_id": 1,
|
4 |
"pad_token_id": 0,
|
5 |
-
"transformers_version": "4.35.2"
|
6 |
-
"use_cache": false
|
7 |
}
|
|
|
1 |
{
|
2 |
+
"_from_model_config": true,
|
3 |
"decoder_start_token_id": 0,
|
4 |
"eos_token_id": 1,
|
5 |
"pad_token_id": 0,
|
6 |
+
"transformers_version": "4.35.2"
|
|
|
7 |
}
|
logs/events.out.tfevents.1702371753.ubuntu138.56717.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2c58af3c8bb122a9f56ca83e576e9563080ac54e883691213c2261bd720f0c5d
|
3 |
+
size 4888
|