qgyd2021 commited on
Commit
b86d6be
·
1 Parent(s): d7efd4b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -1
README.md CHANGED
@@ -8,4 +8,12 @@ library_name: adapter-transformers
8
  pipeline_tag: text-generation
9
  tags:
10
  - reward_model
11
- ---
 
 
 
 
 
 
 
 
 
8
  pipeline_tag: text-generation
9
  tags:
10
  - reward_model
11
+ ---
12
+ ## Reward Model GPT2
13
+
14
+ fine-tuned [GPT2](https://huggingface.co/gpt2) to a reward model.
15
+
16
+ The model is designed to generate human-like responses to questions in [Stack Exchange](https://huggingface.co/datasets/lvwerra/stack-exchange-paired) domains of programming, mathematics, physics, and more.
17
+
18
+ For more info check out the blog post and github [example](https://github.com/huggingface/trl/tree/main/examples/research_projects/stack_llama_2/scripts).
19
+