qgyd2021
/

reward_model_gpt2_stack_exchange

Text Generation

Model card Files Files and versions Community

qgyd2021 commited on Sep 29, 2023

Commit

b86d6be

·

1 Parent(s): d7efd4b

Update README.md

Files changed (1) hide show

README.md +9 -1

README.md CHANGED Viewed

@@ -8,4 +8,12 @@ library_name: adapter-transformers
 pipeline_tag: text-generation
 tags:
 - reward_model
----

 pipeline_tag: text-generation
 tags:
 - reward_model
+---
+## Reward Model GPT2
+fine-tuned [GPT2](https://huggingface.co/gpt2) to a reward model.
+The model is designed to generate human-like responses to questions in [Stack Exchange](https://huggingface.co/datasets/lvwerra/stack-exchange-paired) domains of programming, mathematics, physics, and more.
+For more info check out the blog post and github [example](https://github.com/huggingface/trl/tree/main/examples/research_projects/stack_llama_2/scripts).