mradermacher
/

gpt2-rlhf-implementation-GGUF

reinforcement-learning-from-human-feedback

anthropic-hh-rlhf

chatgpt-style-training

supervised-fine-tuning

human-preferences

Model card Files Files and versions

gpt2-rlhf-implementation-GGUF / README.md

mradermacher's picture

uploaded from leia

3198014 verified 2 months ago

|

376 Bytes

static quants of https://huggingface.co/Vibudhbh/gpt2-rlhf-implementation