Update README.md
Browse files
README.md
CHANGED
@@ -8,4 +8,12 @@ library_name: adapter-transformers
|
|
8 |
pipeline_tag: text-generation
|
9 |
tags:
|
10 |
- reward_model
|
11 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
8 |
pipeline_tag: text-generation
|
9 |
tags:
|
10 |
- reward_model
|
11 |
+
---
|
12 |
+
## Reward Model GPT2
|
13 |
+
|
14 |
+
fine-tuned [GPT2](https://huggingface.co/gpt2) to a reward model.
|
15 |
+
|
16 |
+
The model is designed to generate human-like responses to questions in [Stack Exchange](https://huggingface.co/datasets/lvwerra/stack-exchange-paired) domains of programming, mathematics, physics, and more.
|
17 |
+
|
18 |
+
For more info check out the blog post and github [example](https://github.com/huggingface/trl/tree/main/examples/research_projects/stack_llama_2/scripts).
|
19 |
+
|