Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
6
14
1
TaolinZhang
iridescentttt
Follow
jskos's profile picture
1 follower
·
3 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
6 days ago
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification
upvoted
a
paper
6 days ago
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens
upvoted
a
paper
8 days ago
CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward
View all activity
Organizations
Papers
2
arxiv:
2507.06920
arxiv:
2507.06138
models
0
None public yet
datasets
1
iridescentttt/vtab-1k-png
Updated
Sep 20, 2024
•
1