lewtun HF Staff commited on
Commit
075c51b
·
1 Parent(s): f52be04

Add evaluation results on the plain_text config and test split of launch/gov_report

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the plain_text config and test split of the [launch/gov_report](https://huggingface.co/datasets/launch/gov_report) dataset by @nonchalant-nagavalli, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-launch__gov_report-plain_text-2fa37c-16136228).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=launch/gov_report).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=launch/gov_report).

Files changed (1) hide show
  1. README.md +33 -0
README.md CHANGED
@@ -146,6 +146,39 @@ model-index:
146
  type: gen_len
147
  value: 139.0398
148
  verified: true
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
149
  ---
150
 
151
  # long-t5-tglobal-large-pubmed-3k-booksum-16384-WIP
 
146
  type: gen_len
147
  value: 139.0398
148
  verified: true
149
+ - task:
150
+ type: summarization
151
+ name: Summarization
152
+ dataset:
153
+ name: launch/gov_report
154
+ type: launch/gov_report
155
+ config: plain_text
156
+ split: test
157
+ metrics:
158
+ - name: ROUGE-1
159
+ type: rouge
160
+ value: 37.0248
161
+ verified: true
162
+ - name: ROUGE-2
163
+ type: rouge
164
+ value: 9.0446
165
+ verified: true
166
+ - name: ROUGE-L
167
+ type: rouge
168
+ value: 18.0521
169
+ verified: true
170
+ - name: ROUGE-LSUM
171
+ type: rouge
172
+ value: 33.4723
173
+ verified: true
174
+ - name: loss
175
+ type: loss
176
+ value: 3.381495237350464
177
+ verified: true
178
+ - name: gen_len
179
+ type: gen_len
180
+ value: 211.2066
181
+ verified: true
182
  ---
183
 
184
  # long-t5-tglobal-large-pubmed-3k-booksum-16384-WIP