Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ulab-ai 's Collections
ResearchTown
Sotopia-RL
FusionFactory
IRanker
Router-R1
Time-R1

Sotopia-RL

updated 9 days ago

Sotopia-RL: Reward Design for Social Intelligence

Upvote
-

  • ulab-ai/sotopia-rl-qwen-2.5-7B-grpo

    Updated Jun 7 • 4

  • ulab-ai/sotopia-rl-reward-annotation

    Viewer • Updated 8 days ago • 7.57k • 174 • 1

  • ulab-ai/sotopia-rl-qwen2.5-7B-rm

    Feature Extraction • Updated 8 days ago • 1
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs