Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
HiTZ 's Collections
Whisper
Latxa Instruct
Latxa
Multilingual TruthfulQA
GoLLIE
Ask2Transformers
Metaphor Processing
MATE
EusCrawl
BERnaT
Alpaca LoRA MT
Lemmatization
Pretraining Datasets
Evaluation Datasets
Instruction Datasets
Basque Encoders
OPT RM
Composite Corpus
Medical-mT5
Lessons in Evaluation of Spanish Encoder-only Models
BasqueParl
This is not a dataset
Speech to Text
CONAN-EUS: Counternarrative Generation in Basque and Spanish
EriBERTa
BERTeus
IXAmBERT
Antidote Project
Machine Translation
XNLIeu
Odesia Challenge 2024
Medical MT

OPT RM

updated 22 days ago

OPT reward models

Upvote
-

  • Training Language Models with Language Feedback at Scale

    Paper • 2303.16755 • Published Mar 28, 2023 • 1

  • HiTZ/lmloss-opt-rm-1.3b

    Text Generation • Updated Apr 7, 2023 • 57

  • HiTZ/rmloss-opt-rm-13b

    Text Generation • Updated Apr 7, 2023 • 19
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs