A collection of papers on GUI agents
-
ShowUI: One Vision-Language-Action Model for GUI Visual Agent
Paper • 2411.17465 • Published • 88 -
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
Paper • 2412.04454 • Published • 66 -
AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials
Paper • 2412.09605 • Published • 30