PKU-Alignment

university

https://github.com/PKU-Alignment

PKU-Alignment

Activity Feed

AI & ML interests

Reinforcement Learning, Large Language Models, Value Alignment

Recent Activity

alignmentforever updated a dataset 22 days ago

PKU-Alignment/InterMT

muchvo published a dataset about 2 months ago

PKU-Alignment/VLA-Arena-Scenes

XuehaiPan authored a paper about 2 months ago

AI Alignment: A Comprehensive Survey

View all activity

PKU-Alignment 's collections 6

InterMT

Alignment with Multi-turn Multimodal Understanding and Generation

PKU-Alignment/InterMT

Preview • Updated 22 days ago • 224
PKU-Alignment/InterMT-Bench-Images

Viewer • Updated May 23, 2025 • 1.51k • 1
PKU-Alignment/InterMT-Judge

Updated May 23, 2025 • 2

SafeSora

Towards Safety Alignment of Text2Video Generation

PKU-Alignment/SafeSora

Viewer • Updated Jun 20, 2024 • 51.7k • 110 • 7
PKU-Alignment/SafeSora-Eval

Viewer • Updated Jun 20, 2024 • 600 • 6 • 2
PKU-Alignment/SafeSora-Label

Viewer • Updated Jun 20, 2024 • 57.3k • 30 • 2
PKU-Alignment/SafeSora-Prompt

Viewer • Updated Aug 12, 2024 • 36.6k • 19

ProgressGym

Alignment with a millennium of moral progress

ProgressGym: Alignment with a Millennium of Moral Progress

Paper • 2406.20087 • Published Jun 28, 2024 • 4
Runtime error

4

ProgressGym LeaderBoard

🥇

4
PKU-Alignment/ProgressGym-HistText

Preview • Updated Aug 10, 2024 • 290 • 1
PKU-Alignment/ProgressGym-TimelessQA

Preview • Updated Aug 10, 2024 • 77 • 1

Language Model Resist Alignment

This repository hosts open-sourced models of "Language Model Resist Alignment" (ACL 2025 Main).

PKU-Alignment/Qwen1.5-0.5B-IMDB-Q1-10000

Updated May 31, 2025
PKU-Alignment/Qwen1.5-0.5B-IMDB-Q1-10000-Q2-100

Updated May 31, 2025
PKU-Alignment/Qwen1.5-0.5B-IMDB-Q1-10000-Q2-1000

Updated May 31, 2025 • 3
PKU-Alignment/Qwen1.5-0.5B-IMDB-Q1-10000-Q2-200

Updated May 31, 2025

PKU-SafeRLHF

A safety alignment preference dataset for llama family models

PKU-Alignment/PKU-SafeRLHF

Viewer • Updated Oct 18, 2024 • 164k • 7.51k • 173
PKU-Alignment/PKU-SafeRLHF-single-dimension

Viewer • Updated Jun 14, 2024 • 81.1k • 108 • 3
PKU-Alignment/PKU-SafeRLHF-QA

Viewer • Updated Jun 14, 2024 • 265k • 225 • 7
PKU-Alignment/PKU-SafeRLHF-prompt

Viewer • Updated Jun 14, 2024 • 44.6k • 157 • 5

Align-Anything

PKU-Alignment/align-anything

Viewer • Updated Apr 5, 2025 • 69.4k • 1.67k • 47
PKU-Alignment/Align-Anything-Instruction-100K-zh

Viewer • Updated Oct 10, 2024 • 105k • 118 • 8
PKU-Alignment/Align-Anything-Instruction-100K

Viewer • Updated Oct 10, 2024 • 105k • 76 • 9
PKU-Alignment/Align-Anything-TI2T-Instruction-100K

Viewer • Updated Nov 20, 2024 • 103k • 185 • 1