Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Edit Models filters

Inference Providers
Replicate
Cerebras
Fireworks
Nebius AI Studio
Cohere
SambaNova
Together AI
fal
Novita
Hyperbolic
HF Inference API
Misc
Inference Endpoints
reward
text-generation-inference

Misc with no match

Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts

Models

12
Full-text search
Active filters: reward

TIGER-Lab/AceCodeRM-7B

Updated about 1 month ago • 156 • 4

TIGER-Lab/AceCodeRM-32B

Updated about 1 month ago • 29 • 7

li-jay-cs/test2-rlhf-rm-checkpoint

Updated Dec 21, 2023

li-jay-cs/gpt2-medium-rlhf-rm-checkpoint

Updated Dec 25, 2023 • 3

li-jay-cs/test3-rlhf-rm-checkpoint

Updated Dec 24, 2023

li-jay-cs/gpt2-rlhf-rm-checkpoint

Updated Dec 24, 2023

li-jay-cs/gpt2-training-full-rlhf-rm-checkpoint

Updated Dec 25, 2023 • 1

li-jay-cs/gpt2-last_token_reward_and_full_training-rlhf-rm-checkpoint

Updated Dec 25, 2023

li-jay-cs/1gpu-gpt2-myepoch1-gcp-reward-model

Updated Jan 12, 2024

thobauma/opt-350m

Text Classification • Updated Apr 25, 2024 • 4

ZhangNy/2024-11-18_10-58-28

Updated Nov 18, 2024 • 2

eth-nlped/Qwen2.5-1.5B-pedagogical-rewardmodel

Text Classification • Updated Mar 6 • 47.5k • 2
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs