Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
allenai 's Collections
OLMo 2
olmOCR
DataDecide
OLMoE (January 2025)
PixMo
Tulu 3 Models
Tulu 3 Datasets
Molmo
OLMoE (November 2024)
OLMo Suite
Tulu V2.5 Suite
Reward Bench
Paloma
Tulu V2 Suite
WildBench
SciRIFF
AI2 Safety Toolkit
Zebra Logic Bench
OLMo 2 Preview Post-trained Models
ACE

Reward Bench

updated 9 days ago

Datasets, spaces, and models for the reward model benchmark!

Upvote
9

  • Running
    364
    364

    Reward Bench Leaderboard

    📐

    Explore and analyze RewardBench leaderboard data


  • allenai/reward-bench

    Viewer • Updated Sep 9, 2024 • 8.11k • 7.59k • 94

  • allenai/preference-test-sets

    Viewer • Updated Mar 14, 2024 • 43.2k • 1.08k • 25

  • allenai/reward-bench-results

    Updated 2 days ago • 2.39k • 2

  • RewardBench: Evaluating Reward Models for Language Modeling

    Paper • 2403.13787 • Published Mar 20, 2024 • 23
Upvote
9
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs