FinancialSupport commited on
Commit
74d0cd6
1 Parent(s): 0ab07f7

Update README.md

Browse files

added evaluation :)

Files changed (1) hide show
  1. README.md +7 -2
README.md CHANGED
@@ -121,9 +121,14 @@ model = AutoModelForCausalLM.from_pretrained(model_id, load_in_8bit=True)
121
 
122
  ## Evaluation
123
 
124
- <!-- This section describes the evaluation protocols and provides the results. -->
 
 
 
 
 
 
125
 
126
- *Coming soon*!
127
 
128
  ## Citation
129
 
 
121
 
122
  ## Evaluation
123
 
124
+ For a detailed comparison of model performance, check out the [Leaderboard for Italian Language Models](https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard).
125
+
126
+ Here's a breakdown of the performance metrics:
127
+ | Metric | hellaswag_it acc_norm | arc_it acc_norm | m_mmlu_it 5-shot acc | Average |
128
+ |:----------------------------|:----------------------|:----------------|:---------------------|:--------|
129
+ | **Accuracy Normalized** | 0.6566 | 0.5004 | 0.6084 | 0.588 |
130
+
131
 
 
132
 
133
  ## Citation
134