Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ulab-ai
's Collections
ResearchTown
Sotopia-RL
FusionFactory
IRanker
Router-R1
Time-R1
Sotopia-RL
updated
9 days ago
Sotopia-RL: Reward Design for Social Intelligence
Upvote
-
ulab-ai/sotopia-rl-qwen-2.5-7B-grpo
Updated
Jun 7
•
4
ulab-ai/sotopia-rl-reward-annotation
Viewer
•
Updated
8 days ago
•
7.57k
•
174
•
1
ulab-ai/sotopia-rl-qwen2.5-7B-rm
Feature Extraction
•
Updated
8 days ago
•
1
Upvote
-
Share collection
View history
Collection guide
Browse collections