Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
6
10
2
Yuchen Zhuang
PRO
yczhuang
Follow
aindilis's profile picture
potato18z's profile picture
2 followers
·
4 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
4 days ago
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
upvoted
a
paper
6 days ago
Multiplayer Nash Preference Optimization
upvoted
a
paper
6 days ago
AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play
View all activity
Organizations
Papers
6
arxiv:
2505.07782
arxiv:
2502.06589
arxiv:
2310.13227
arxiv:
2306.15895
Expand 6 papers
models
7
Sort: Recently updated
yczhuang/IF-Embed-Gemma-Embedding-2B
2B
•
Updated
Apr 12
•
1
yczhuang/IF-Embed-Qwen-2.5-1.5B
2B
•
Updated
Apr 11
•
4
yczhuang/IF-Embed-Qwen-2.5-1.5B-Instruct
2B
•
Updated
Apr 8
•
3
yczhuang/webagent-7b-grpo-ckpt-400
8B
•
Updated
Apr 7
•
1
yczhuang/webagent-7b-grpo-ckpt-300
8B
•
Updated
Apr 7
•
1
yczhuang/webagent-7b-grpo-ckpt-200
8B
•
Updated
Apr 7
yczhuang/qwen-3b-sft-webagent
3B
•
Updated
Mar 20
•
4
datasets
4
Sort: Recently updated
yczhuang/Hephaestus-Forge
Viewer
•
Updated
28 days ago
•
3.81k
•
82
•
1
yczhuang/test
Updated
Jun 4
•
5
yczhuang/webagent-r1-distill
Viewer
•
Updated
Apr 25
•
2.66k
•
11
yczhuang/forms-filling-r1-distill
Viewer
•
Updated
Feb 24
•
49.1k
•
5