mradermacher
/

gpt2-rlhf-anthropic-GGUF

reinforcement-learning-from-human-feedback

anthropic-hh-rlhf

chatgpt-style-training

supervised-fine-tuning

human-preferences

Model card Files Files and versions

gpt2-rlhf-anthropic-GGUF / gpt2-rlhf-anthropic.Q3_K_M.gguf

Commit History

uploaded from leia

c5dd94b
verified

mradermacher commited on 9 days ago