Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
TEEN-D
's Collections
HallOumi GRPO
Reinforcement Learning
HallOumi GRPO
updated
23 days ago
HallOumi training data prepared for a GRPO trainer.
Upvote
-
TEEN-D/grpo-oumi-anli-subset
Viewer
•
Updated
23 days ago
•
21.1k
•
122
TEEN-D/grpo-oumi-c2d-d2c-subset
Viewer
•
Updated
23 days ago
•
14.4k
•
91
TEEN-D/grpo-oumi-synthetic-claims
Viewer
•
Updated
23 days ago
•
19.2k
•
67
TEEN-D/grpo-oumi-synthetic-document-claims
Viewer
•
Updated
23 days ago
•
8.4k
•
91
Upvote
-
Share collection
View history
Collection guide
Browse collections