vwxyzjn commited on
Commit
10509a2
·
verified ·
1 Parent(s): 24fe4c5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -45,10 +45,10 @@ Tülu3 is designed for state-of-the-art performance on a diversity of tasks in a
45
  |-----------|-------------------|
46
  | **Base Model** | [meta-llama/llama-3.1-405B](https://huggingface.co/meta-llama/llama-3.1-405B) |
47
  | **SFT** | [allenai/llama-3.1-Tulu-3-405B-SFT](https://huggingface.co/allenai/llama-3.1-Tulu-3-405B-SFT) |
48
- | **Final Model (DPO)** | [allenai/llama-3.1-Tulu-3-405B](https://huggingface.co/allenai/llama-3.1-Tulu-3-405B) |
 
49
  | **Reward Model (RM)**| (Same as 8B)
50
 
51
-
52
  ## Using the model
53
 
54
  ### Loading with HuggingFace
 
45
  |-----------|-------------------|
46
  | **Base Model** | [meta-llama/llama-3.1-405B](https://huggingface.co/meta-llama/llama-3.1-405B) |
47
  | **SFT** | [allenai/llama-3.1-Tulu-3-405B-SFT](https://huggingface.co/allenai/llama-3.1-Tulu-3-405B-SFT) |
48
+ | **DPO** | [allenai/llama-3.1-Tulu-3-405B-DPO](https://huggingface.co/allenai/llama-3.1-Tulu-3-405B-DPO) |
49
+ | **Final Model (RLVR)** | [allenai/llama-3.1-Tulu-3-405B](https://huggingface.co/allenai/llama-3.1-Tulu-3-405B) |
50
  | **Reward Model (RM)**| (Same as 8B)
51
 
 
52
  ## Using the model
53
 
54
  ### Loading with HuggingFace