Add evaluation results on the plain_text config and test split of launch/gov_report

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the plain_text config and test split of the [launch/gov_report](https://huggingface.co/datasets/launch/gov_report) dataset by @nonchalant-nagavalli, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-launch__gov_report-plain_text-2fa37c-16136228).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=launch/gov_report).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=launch/gov_report).

Files changed (1) hide show

README.md +33 -0

README.md CHANGED Viewed

@@ -146,6 +146,39 @@ model-index:
       type: gen_len
       value: 139.0398
       verified: true
 ---
 # long-t5-tglobal-large-pubmed-3k-booksum-16384-WIP

       type: gen_len
       value: 139.0398
       verified: true
+  - task:
+      type: summarization
+      name: Summarization
+    dataset:
+      name: launch/gov_report
+      type: launch/gov_report
+      config: plain_text
+      split: test
+    metrics:
+    - name: ROUGE-1
+      type: rouge
+      value: 37.0248
+      verified: true
+    - name: ROUGE-2
+      type: rouge
+      value: 9.0446
+      verified: true
+    - name: ROUGE-L
+      type: rouge
+      value: 18.0521
+      verified: true
+    - name: ROUGE-LSUM
+      type: rouge
+      value: 33.4723
+      verified: true
+    - name: loss
+      type: loss
+      value: 3.381495237350464
+      verified: true
+    - name: gen_len
+      type: gen_len
+      value: 211.2066
+      verified: true
 ---
 # long-t5-tglobal-large-pubmed-3k-booksum-16384-WIP