Update README.md
Browse files
README.md
CHANGED
|
@@ -10,7 +10,9 @@ base_model:
|
|
| 10 |
---
|
| 11 |
# Reverse Text Model Qwen3-0.6B
|
| 12 |
|
| 13 |
-
Simple model that was RL FT for 20 steps / epochs after SFT to reverse text using [prime-rl](https://github.com/PrimeIntellect-ai/prime-rl/) and [reverse-text](https://github.com/PrimeIntellect-ai/prime-environments/tree/main/environments/reverse_text)
|
|
|
|
|
|
|
| 14 |
|
| 15 |
## Example Prompt & Reward
|
| 16 |
|
|
|
|
| 10 |
---
|
| 11 |
# Reverse Text Model Qwen3-0.6B
|
| 12 |
|
| 13 |
+
Simple model that was RL FT for 20 steps / epochs after SFT to reverse text using [prime-rl](https://github.com/PrimeIntellect-ai/prime-rl/) and [reverse-text](https://github.com/PrimeIntellect-ai/prime-environments/tree/main/environments/reverse_text). See the improvement in results:
|
| 14 |
+
|
| 15 |
+

|
| 16 |
|
| 17 |
## Example Prompt & Reward
|
| 18 |
|