Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Peter Shaw
PeterShaw
Follow
AI & ML interests
None yet
Recent Activity
authored
a paper
about 19 hours ago
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
View all activity
Organizations
None yet
Papers
1
arxiv:
2504.08942
models
None public yet
datasets
None public yet