Alignment with Multi-turn Multimodal Understanding and Generation
AI & ML interests
Reinforcement Learning, Large Language Models, Value Alignment
Recent Activity
View all activity
This repository hosts open-sourced models of "Language Model Resist Alignment" (ACL 2025 Main).
A safety alignment preference dataset for llama family models
-
PKU-Alignment/PKU-SafeRLHF
Viewer • Updated • 164k • 5.86k • 154 -
PKU-Alignment/PKU-SafeRLHF-single-dimension
Viewer • Updated • 81.1k • 62 • 2 -
PKU-Alignment/PKU-SafeRLHF-QA
Viewer • Updated • 265k • 326 • 7 -
PKU-Alignment/PKU-SafeRLHF-prompt
Viewer • Updated • 44.6k • 134 • 5
-
PKU-Alignment/align-anything
Viewer • Updated • 69.4k • 1.43k • 41 -
PKU-Alignment/Align-Anything-Instruction-100K-zh
Viewer • Updated • 105k • 186 • 8 -
PKU-Alignment/Align-Anything-Instruction-100K
Viewer • Updated • 105k • 86 • 9 -
PKU-Alignment/Align-Anything-TI2T-Instruction-100K
Viewer • Updated • 103k • 232 • 1
Alignment with Multi-turn Multimodal Understanding and Generation
This repository hosts open-sourced models of "Language Model Resist Alignment" (ACL 2025 Main).
Towards Safety Alignment of Text2Video Generation
A safety alignment preference dataset for llama family models
-
PKU-Alignment/PKU-SafeRLHF
Viewer • Updated • 164k • 5.86k • 154 -
PKU-Alignment/PKU-SafeRLHF-single-dimension
Viewer • Updated • 81.1k • 62 • 2 -
PKU-Alignment/PKU-SafeRLHF-QA
Viewer • Updated • 265k • 326 • 7 -
PKU-Alignment/PKU-SafeRLHF-prompt
Viewer • Updated • 44.6k • 134 • 5
Alignment with a millennium of moral progress
-
PKU-Alignment/align-anything
Viewer • Updated • 69.4k • 1.43k • 41 -
PKU-Alignment/Align-Anything-Instruction-100K-zh
Viewer • Updated • 105k • 186 • 8 -
PKU-Alignment/Align-Anything-Instruction-100K
Viewer • Updated • 105k • 86 • 9 -
PKU-Alignment/Align-Anything-TI2T-Instruction-100K
Viewer • Updated • 103k • 232 • 1