Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
PKU-Alignment
/
beaver-7b-v1.0
like
10
Follow
PKU-Alignment
36
Reinforcement Learning
Safetensors
PKU-Alignment/PKU-SafeRLHF
English
safe-rlhf
llama
reinforcement-learning-from-human-feedback
rlhf
safety
ai-safety
deepspeed
beaver
alpaca
arxiv:
2302.13971
arxiv:
2307.04657
arxiv:
2310.12773
Model card
Files
Files and versions
Train
main
beaver-7b-v1.0
Commit History
Update README.md
77e66d2
verified
XuehaiPan
commited on
May 9
Update README.md
ddd8b5a
XuehaiPan
commited on
Apr 20
Convert model checkpoint to safetensors
c077f71
XuehaiPan
commited on
Apr 19
Update example usage
7c280b0
XuehaiPan
commited on
Dec 18, 2023
Update model card
f022918
XuehaiPan
commited on
Jul 17, 2023
Update README.md
f72580f
RuiyangSun
commited on
Jul 12, 2023
docs: update readme
1071a1a
RuiyangSun
commited on
Jul 10, 2023
docs: update README
579479d
XuehaiPan
commited on
Jul 10, 2023
chore: update metadata
e08f20c
RuiyangSun
commited on
Jul 10, 2023
hello beaver-7b-v1.0
d586bac
RuiyangSun
commited on
Jul 7, 2023
initial commit
3076c4e
RuiyangSun
commited on
Jun 24, 2023