Update README.md
Browse files
README.md
CHANGED
@@ -18,6 +18,8 @@ This model is based on **Qwen2.5-3B-Instruct** and trained with **PPO (Proximal
|
|
18 |
|
19 |
Github: https://github.com/lichengliu03/unary-feedback
|
20 |
|
|
|
|
|
21 |
## Model Info
|
22 |
|
23 |
- **Base model**: Qwen/Qwen2.5-3B-Instruct
|
|
|
18 |
|
19 |
Github: https://github.com/lichengliu03/unary-feedback
|
20 |
|
21 |
+
Website: https://unary-feedback.github.io/
|
22 |
+
|
23 |
## Model Info
|
24 |
|
25 |
- **Base model**: Qwen/Qwen2.5-3B-Instruct
|