Update README.md
Browse files
README.md
CHANGED
@@ -6,4 +6,6 @@ base_model:
|
|
6 |
- Qwen/Qwen3-0.6B
|
7 |
pipeline_tag: summarization
|
8 |
library_name: peft
|
9 |
-
---
|
|
|
|
|
|
6 |
- Qwen/Qwen3-0.6B
|
7 |
pipeline_tag: summarization
|
8 |
library_name: peft
|
9 |
+
---
|
10 |
+
|
11 |
+
Source code available at https://github.com/phhusson/llm-rl/blob/main/grpo-tldr.py
|