RLHFlow

university
Activity Feed

AI & ML interests

Workflow of Reinforcement Learning from Human Feedback (RLHF). Blog: https://rlhflow.github.io/

Recent Activity

Chenlu123  updated a model 4 days ago
RLHFlow/Qwen2.5-7B-SFT
Chenlu123  updated a model 4 days ago
RLHFlow/Qwen2.5-7B-RAFT-Zero
Chenlu123  updated a model 4 days ago
RLHFlow/Qwen2.5-7B-DPO-NLL-Zero
View all activity