Update README.md
Browse files
README.md
CHANGED
@@ -143,8 +143,7 @@ Implementations of SacreBLEU, BERT Score, ROUGLE, METEOR, and SARI are from Hugg
|
|
143 |
|
144 |
## Results
|
145 |
|
146 |
-
We tested our model on the SAS test set (200 samples). We generate 10 lay summaries based on each sample's abstract. During generation, we used top-p sampling with
|
147 |
-
The mean performance is reported below.
|
148 |
|
149 |
|
150 |
| Metrics | SAS |
|
@@ -157,7 +156,7 @@ The mean performance is reported below.
|
|
157 |
| METEOR↑ | 43.75 |
|
158 |
| SARI↑ | 51.96 |
|
159 |
| ARI↓ | 17.04 |
|
160 |
-
Note: 1. Some generated texts are too short (less than 100 words) to calcualte meaningful ARI. We therefore concatenated adjecent five texts and compute ARI for the 400 longer texts (instead of original 2,000 texts). 2. BERT score, ROUGE, and METEOR multiplied by 100.
|
161 |
|
162 |
|
163 |
# Contact
|
|
|
143 |
|
144 |
## Results
|
145 |
|
146 |
+
We tested our model on the SAS test set (200 samples). We generate 10 lay summaries based on each sample's abstract. During generation, we used top-p sampling with p=0.9. The mean performance is reported below.
|
|
|
147 |
|
148 |
|
149 |
| Metrics | SAS |
|
|
|
156 |
| METEOR↑ | 43.75 |
|
157 |
| SARI↑ | 51.96 |
|
158 |
| ARI↓ | 17.04 |
|
159 |
+
Note: 1. Some generated texts are too short (less than 100 words) to calcualte meaningful ARI. We therefore concatenated adjecent five texts and compute ARI for the 400 longer texts (instead of original 2,000 texts). 2. BERT score, ROUGE, and METEOR are multiplied by 100.
|
160 |
|
161 |
|
162 |
# Contact
|