Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Edit Models filters

Model Tree
lblaoke/opt-125m-hh-rlhf-chosen-sft-trl-v5
Finetunes
Inference Providers
Cohere
Nebius AI Studio
Fireworks
Novita
Together AI
Hyperbolic
fal
SambaNova
Replicate
Nscale
Featherless AI
Cerebras
HF Inference API
Misc
Inference Endpoints
text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts

Models

2
Full-text search
Active filters: lblaoke/opt-125m-hh-rlhf-chosen-sft-trl-v5

lblaoke/opt-125m-hh-rlhf-dpo-trl-v5

Updated May 8 • 18

lblaoke/opt-125m-hh-rlhf-rm-trl-v5

Updated May 9 • 5
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs