Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
5
8
guox18
guox18
Follow
0 followers
ยท
4 following
guox18
AI & ML interests
Alignment
Recent Activity
upvoted
a
paper
about 1 month ago
Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning
authored
a paper
about 1 month ago
Consensus Entropy: Harnessing Multi-VLM Agreement for Self-Verifying and Self-Improving OCR
authored
a paper
about 1 month ago
IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards
View all activity
Organizations
None yet
guox18
's datasets
1
Sort:ย Recently updated
guox18/IFDecorator
Preview
โข
Updated
Aug 8
โข
103
โข
1