Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
Commit
·
e275da7
1
Parent(s):
034ac91
update url to scoring function
Browse files
dabstep_benchmark/content.py
CHANGED
@@ -22,7 +22,7 @@ Mandatory fields are: `task_id` and `agent_answer`. However, `reasoning_trace` i
|
|
22 |
{"task_id": "task_id_2", "agent_answer": "Answer 2 from your agent", "reasoning_trace": "The different steps by which your model reached answer 2"}
|
23 |
```
|
24 |
|
25 |
-
Our scoring function can be found [here](https://huggingface.co/spaces/adyen/data-agents-benchmark/blob/main/
|
26 |
"""
|
27 |
|
28 |
CITATION_BUTTON_LABEL = "Copy the following snippet to cite these results"
|
|
|
22 |
{"task_id": "task_id_2", "agent_answer": "Answer 2 from your agent", "reasoning_trace": "The different steps by which your model reached answer 2"}
|
23 |
```
|
24 |
|
25 |
+
Our scoring function can be found [here](https://huggingface.co/spaces/adyen/data-agents-benchmark/blob/main/dabstep_benchmark/evaluation/scorer.py).
|
26 |
"""
|
27 |
|
28 |
CITATION_BUTTON_LABEL = "Copy the following snippet to cite these results"
|