Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
10
Zhiyuan Hu
zhiyuanhucs
Follow
21world's profile picture
John6666's profile picture
cs-fxr's profile picture
4 followers
·
0 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 15 hours ago
GTA1: GUI Test-time Scaling Agent
upvoted
a
paper
30 days ago
Reinforcement Pre-Training
updated
a model
about 1 month ago
zhiyuanhucs/JudgeLRM-14B-reward-wo-score-step400
View all activity
Organizations
None yet
Papers
4
arxiv:
2505.10554
arxiv:
2504.00050
arxiv:
2411.14251
arxiv:
2409.00509
spaces
1
Running
1
Meta Ability
ðŸ§
Enhance reasoning in large models by aligning meta-abilities
models
102
Sort:Â Recently updated
zhiyuanhucs/JudgeLRM-14B-reward-wo-score-step400
15B
•
Updated
about 1 month ago
•
5
zhiyuanhucs/JudgeLRM-14B-reward-w-length-step400
15B
•
Updated
about 1 month ago
•
5
zhiyuanhucs/JudgeLRM-14B-reward-step1400
Updated
about 1 month ago
•
4
zhiyuanhucs/JudgeLRM-14B-reward-step1200
15B
•
Updated
about 1 month ago
•
6
zhiyuanhucs/JudgeLRM-14B-reward-step1000
15B
•
Updated
about 1 month ago
•
5
zhiyuanhucs/JudgeLRM-14B-reward-step800
15B
•
Updated
about 1 month ago
•
5
zhiyuanhucs/JudgeLRM-14B-reward-step600
15B
•
Updated
about 1 month ago
•
5
zhiyuanhucs/JudgeLRM-14B-reward-wo-score-step200
15B
•
Updated
about 1 month ago
•
4
zhiyuanhucs/JudgeLRM-14B-reward-w-length-step200
15B
•
Updated
about 1 month ago
•
4
zhiyuanhucs/JudgeLRM-14B-reward-step400
15B
•
Updated
about 1 month ago
•
3
View 102 models
datasets
3
Sort:Â Recently updated
zhiyuanhucs/amc23_dup32
Viewer
•
Updated
May 7
•
1.28k
•
20
zhiyuanhucs/AIME24_dup32
Viewer
•
Updated
May 7
•
960
•
24
zhiyuanhucs/AIME_1983_2024
Viewer
•
Updated
Mar 3
•
933
•
36