Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
6
21
20
Ganqu Cui
ganqu
Follow
ZSKHGA's profile picture
shuyuej's profile picture
Lynncc6's profile picture
21 followers
·
2 following
cgq15
AI & ML interests
None yet
Recent Activity
authored
a paper
1 day ago
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe
authored
a paper
7 days ago
FlowRL: Matching Reward Distributions for LLM Reasoning
upvoted
a
paper
14 days ago
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
View all activity
Organizations
Articles
1
Article
29
Process Reinforcement through Implicit Rewards
Papers
18
arxiv:
2509.15207
arxiv:
2509.18154
arxiv:
2505.22617
arxiv:
2504.16084
Expand 18 papers
models
0
None public yet
datasets
1
ganqu/openbackdoor
Preview
•
Updated
Oct 23, 2024
•
31