Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
RLHFlow
's Collections
Decision-Tree Reward Models
RLHFlow MATH Process Reward Model
Standard-format-preference-dataset
Mixture-of-preference-reward-modeling
RM-Bradley-Terry
PM-pair
Online RLHF
RLHFLow Reward Models
SFT Models
Decision-Tree Reward Models
updated
2 days ago
Upvote
1
RLHFlow/Decision-Tree-Reward-Gemma-2-27B
Text Classification
•
Updated
6 days ago
•
54
•
1
RLHFlow/Decision-Tree-Reward-Llama-3.1-8B
Text Classification
•
Updated
6 days ago
•
107
RLHFlow/LLM-Preferences-HelpSteer2
Updated
3 days ago
•
8
Upvote
1
Share collection
View history
Collection guide
Browse collections