5 10 14

Boyuan Zheng

boyuanzheng010

https://boyuanzheng010.github.io/

AI & ML interests

Language Agents, Multilinguality

Recent Activity

upvoted a paper 5 days ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

updated a dataset about 1 month ago

osunlp/WebGuard

published a dataset about 1 month ago

osunlp/WebGuard

View all activity

Organizations

upvoted a paper 5 days ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published 9 days ago • 76

upvoted a paper 2 months ago

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Paper • 2506.21506 • Published Jun 26 • 51

upvoted 2 papers 5 months ago

AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories

Paper • 2504.08942 • Published Apr 11 • 27

SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills

Paper • 2504.07079 • Published Apr 9 • 11

upvoted an article 6 months ago

Article

Open R1: Update #3

and 9 others •

Mar 11

• 295

upvoted a paper 10 months ago

Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents

Paper • 2411.06559 • Published Nov 10, 2024 • 15

upvoted a paper 11 months ago

Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents

Paper • 2410.05243 • Published Oct 7, 2024 • 19

upvoted 2 papers over 1 year ago

MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions

Paper • 2403.19651 • Published Mar 28, 2024 • 23

GPT-4V(ision) is a Generalist Web Agent, if Grounded

Paper • 2401.01614 • Published Jan 3, 2024 • 23

upvoted a paper almost 2 years ago

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

Paper • 2311.16502 • Published Nov 27, 2023 • 36

Boyuan Zheng

AI & ML interests

Recent Activity

Organizations

boyuanzheng010's activity

Open R1: Update #3