Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
rm-robustness 's Collections
[ICML 2025] Robustness in RMs

[ICML 2025] Robustness in RMs

updated May 27

Dataset and reward models for "On the Robustness of Reward Models for Language Model Alignment (ICML 2025)"

Upvote
-

  • rm-robustness/ultrafeedback-train

    Viewer • Updated May 11 • 51.2k • 8

  • rm-robustness/ultrafeedback-valid-1-in-domain

    Viewer • Updated May 11 • 51.2k • 8

  • rm-robustness/ultrafeedback-valid-2-prompt-ood

    Viewer • Updated May 11 • 11.1k • 6

  • rm-robustness/ultrafeedback-valid-3-response-ood

    Viewer • Updated May 11 • 51.2k • 3

  • rm-robustness/ultrafeedback-valid-4-mutual-ood

    Viewer • Updated May 11 • 11.1k • 9

  • rm-robustness/L31-8B-SKPv2-BSR-1e2

    Text Classification • 8B • Updated May 11 • 3

  • rm-robustness/L31-8B-SKPv2-BSR-1e3

    Text Classification • 8B • Updated May 11 • 3

  • rm-robustness/L31-8B-SKPv2-BSR-1e4

    Text Classification • 8B • Updated May 11 • 8
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs