mradermacher
/

gpt2-rlhf-anthropic-GGUF

reinforcement-learning-from-human-feedback

anthropic-hh-rlhf

chatgpt-style-training

supervised-fine-tuning

human-preferences

Model card Files Files and versions

Resources

View closed (0)

Welcome to the community

The community tab is the place to discuss and collaborate with the HF community!