PKU-Alignment
/

beaver-7b-v3.0-reward

Reinforcement Learning

reinforcement-learning-from-human-feedback

Model card Files Files and versions Community

Resources

View closed (0)

Welcome to the community

The community tab is the place to discuss and collaborate with the HF community!