Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ernie-research 's Collections
Tool-Augmented Reward Models
Multilingual Code Pre-training (ERNIE-Code)
Pixel-based Pre-training (PixelGPT)
Macro-Action RLHF

Tool-Augmented Reward Models

updated 19 days ago

[ICLR'24 Spotlight] Tool-Augmented Reward Modeling

Upvote
-

  • Tool-Augmented Reward Modeling

    Paper • 2310.01045 • Published Oct 2, 2023 • 2

  • ernie-research/TARA

    Preview • Updated 3 days ago • 95 • 1

  • ernie-research/Themis-7b

    Updated 3 days ago • 32 • 4
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs