Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
Ao Qu
PRO
quao627
Follow
0 followers
·
2 following
quao627
AI & ML interests
RL, LLM, Video Generation, RecSYS, Data Mining
Recent Activity
updated
a model
about 18 hours ago
Mem-Lab/Qwen2.5-7B-RL-RAG-Q2-step-160
published
a model
about 19 hours ago
Mem-Lab/Qwen2.5-7B-RL-RAG-Q2-step-160
upvoted
a
paper
22 days ago
Web-Shepherd: Advancing PRMs for Reinforcing Web Agents
View all activity
Organizations
Papers
2
arxiv:
2506.02242
arxiv:
2202.03630
models
1
quao627/nq-search-r1-ppo-qwen2.5-base-3b
Updated
Apr 24
•
13
•
1
datasets
3
Sort: Recently updated
quao627/hotpotqa-bm25-dev-500
Updated
Apr 15
•
8
quao627/list_functions_meta
Viewer
•
Updated
Mar 19
•
80
•
34
quao627/list_functions
Viewer
•
Updated
Mar 19
•
6k
•
53