autoevaluator HF staff commited on
Commit
bb6e97e
·
1 Parent(s): 534ca44

Add evaluation results on the default config and test split of xsum

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the default config and test split of the [xsum](https://huggingface.co/datasets/xsum) dataset by

@zuzannad1
, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-xsum-default-e3e096-60495145411).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=xsum).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=xsum).

Files changed (1) hide show
  1. README.md +8 -8
README.md CHANGED
@@ -58,25 +58,25 @@ model-index:
58
  split: test
59
  metrics:
60
  - type: rouge
61
- value: 26.8921
62
  name: ROUGE-1
63
  verified: true
64
- verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZmFkMTFiNmM3YmRkZDk1Y2FhM2EwOTdiYmUwYjBhMGEzZmIyZmIwNWI5OTVmY2U0N2QzYzgxYzM0OTEzMjFjNSIsInZlcnNpb24iOjF9.fOq4zI_BWvTLFJFQOWNk3xEsDIu3aAeboGYPw5TiBqdJJjvdyKmLbfj2WVnNboWbrmp1PuL01iJjTi2Xj6PUAA
65
  - type: rouge
66
- value: 6.9411
67
  name: ROUGE-2
68
  verified: true
69
- verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMTBlZmI3NjQ3M2JiYzI4MTg3YmJkMjg0ZmE5MDUwNzljNTYyM2M0NzA3YTNiNTA2Nzk4MDhhYWZjZjgyMmE1MCIsInZlcnNpb24iOjF9.rH0DY2hMz2rXaK29vkt7xah-3G95rY4MOS2oVKjXmw4TijB-ZVytfLJAlBmyqA8HYAythRCywmLSjjCDWc66Cg
70
  - type: rouge
71
- value: 21.2832
72
  name: ROUGE-L
73
  verified: true
74
- verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiODAwZDYzNTc0NjZhNzNiMDE2ZDY2NjNjNmViNTc0NDVjNTZkYjljODhmYmNiMWFhY2NkZjU5MzQ0NmM0OTcyMSIsInZlcnNpb24iOjF9.5duHtdjZ8dwtbp1HKyMR4mVK9IIlEZvuWGjQMErpE7VNyKPhMOT6Avh_vXFQz6q_jBzqpZGGREho1mt50yBsDw
75
  - type: rouge
76
- value: 21.284
77
  name: ROUGE-LSUM
78
  verified: true
79
- verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMGQ2NmNhZTZmZDFkNTcyYjQ4MjhhYWJhODY1ZGRjODY2ZTE5MmRmZDRlYTk4NWE4YWM1OWY2M2NjOWQ3YzU0OCIsInZlcnNpb24iOjF9.SJ8xTcAVWrRDmJmQoxE1ADIcdGA4tr3V04Lv0ipMJiUksCdNC7FO8jYbjG9XmiqbDnnr5h4XoK4JB4-GsA-gDA
80
  - type: loss
81
  value: 2.5411810874938965
82
  name: loss
 
58
  split: test
59
  metrics:
60
  - type: rouge
61
+ value: 26.8918
62
  name: ROUGE-1
63
  verified: true
64
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYjRmMjlkY2JiNTFkZTMyYmE0ZGRkYjc4MzlkMTY5ZTNlNzFiYzc2NjlhZDRmZDZjOTY2NGI1OWY0ZjI2NmQzZCIsInZlcnNpb24iOjF9.CZDLCP9IwNbTI1jqpBw9-VptBsWsq89Ye6P2GbNxXjprfUSXf4txyL_wqseA2K71UU331Y5kNREKmgrPbRuWDw
65
  - type: rouge
66
+ value: 6.9481
67
  name: ROUGE-2
68
  verified: true
69
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiOTY5ZjgwMDAwMjAxYWYyZWNiOTk2MWQ1MDEzYmU3NzMyZjgzZWE4NzU1MDA2ZmIzNjQ1NDlkOWEzOTcwN2VhNCIsInZlcnNpb24iOjF9.fcsCcrO2AhepIO6p49Xl9cBHlBPgVeYLyidffcDnFr9A4mfe19rxbLmG9q9ZsK52SlhLF1c5q47Fl7iOevPGAA
70
  - type: rouge
71
+ value: 21.2908
72
  name: ROUGE-L
73
  verified: true
74
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMmFhMTkwZTgxZDk0MjM1ZTg2ZDBmOTRkZTNjNjIwZWMwNDM5YTAzMWU3YzUxZWJlMTY5ZWQ4MGM0OGU3MWYyZCIsInZlcnNpb24iOjF9.H3vo2Qf7mT244VPTSDepZxqmupsV13sOCK3Ldvt-49khliGtncmnu9zmJT4J9Y11-1YFpl4hs_V4ooUXQp67Cg
75
  - type: rouge
76
+ value: 21.2827
77
  name: ROUGE-LSUM
78
  verified: true
79
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNzhhMzIzNThkMGJjNWMwZWIxYTczNDNmMzI3ZTAzMWI0NDg1ZWIwMTFhNGEzMTJjYTYyYmJmYTNjODQzODUwZCIsInZlcnNpb24iOjF9.NfbiSoRGRvhEnETkpgRMfrHHou7C1_v9jXbfO10H9JZVRaVAr8lpQ8jvHnf9Rpr4QGFo3ZDMDRnjxDZT5W8VBw
80
  - type: loss
81
  value: 2.5411810874938965
82
  name: loss