Update README.md
Browse files
README.md
CHANGED
@@ -30,7 +30,14 @@ GOVERNING TERMS: Use of this model is governed by the [NVIDIA Open Model License
|
|
30 |
|
31 |
## Arena Hard LeaderBoard
|
32 |
|
33 |
-
As of 18 Mar 2025, augmenting models with the Feedback-Edit Inference Time Scaling (ITS) approach
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
34 |
|
35 |
| Model | Arena Hard (95% CI) |
|
36 |
|:-----------------------------|:----------------|
|
|
|
30 |
|
31 |
## Arena Hard LeaderBoard
|
32 |
|
33 |
+
As of 18 Mar 2025, augmenting models with the Feedback-Edit Inference Time Scaling (ITS) approach leads to the highest performance on Arena Hard.
|
34 |
+
|
35 |
+
The Feedback-Edit Inference Time Scaling system comprise of the following models:
|
36 |
+
|
37 |
+
1. [Llama-3.3-Nemotron-70B-Feedback](https://huggingface.co/nvidia/Llama-3.3-Nemotron-70B-Feedback)
|
38 |
+
2. [Llama-3.3-Nemotron-70B-Edit](https://huggingface.co/nvidia/Llama-3.3-Nemotron-70B-Edit)
|
39 |
+
3. [Llama-3.3-Nemotron-70B-Select](https://huggingface.co/nvidia/Llama-3.3-Nemotron-70B-Select)
|
40 |
+
|
41 |
|
42 |
| Model | Arena Hard (95% CI) |
|
43 |
|:-----------------------------|:----------------|
|