Commit
·
08796a7
1
Parent(s):
5c2a615
Update benchmark table
Browse files
app.py
CHANGED
@@ -192,6 +192,7 @@ with gr.Blocks() as demo:
|
|
192 |
| Simple Bench | 🟠 42% |
|
193 |
| EMMA-Mini | 🟠 48% |
|
194 |
| PlanBench | 🟠 53% |
|
|
|
195 |
| GAIA | 🟡 65% |
|
196 |
| LiveBench Language | 🟡 65% |
|
197 |
| LiveBench Data Analysis | 🟡 71% |
|
|
|
192 |
| Simple Bench | 🟠 42% |
|
193 |
| EMMA-Mini | 🟠 48% |
|
194 |
| PlanBench | 🟠 53% |
|
195 |
+
| NYT Connections | 🟡 60% |
|
196 |
| GAIA | 🟡 65% |
|
197 |
| LiveBench Language | 🟡 65% |
|
198 |
| LiveBench Data Analysis | 🟡 71% |
|