xuetianci
xuetianci99
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
9 days ago
Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
updated
a dataset
14 days ago
agent-evals/hal_traces
upvoted
a
paper
about 2 months ago
An Illusion of Progress? Assessing the Current State of Web Agents