Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
virtuoussy
's Collections
RLVR
RLVR
updated
16 days ago
Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains'
Upvote
11
+1
virtuoussy/Qwen2.5-7B-Instruct-RLVR
Updated
14 days ago
•
107
•
11
virtuoussy/Math-RLVR
Viewer
•
Updated
14 days ago
•
782k
•
283
•
6
virtuoussy/Multi-subject-RLVR
Viewer
•
Updated
14 days ago
•
579k
•
1.03k
•
51
Upvote
11
+7
Share collection
View history
Collection guide
Browse collections