Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
InfiX-ai 's Collections
InfiFusion: Model Fusion & Model Merging
InfiR: Reasoning-Enhanced Low-Resource Training Pipeline
InfiGUI: Advanced Vision-Native Agent for GUI Interaction

InfiGUI: Advanced Vision-Native Agent for GUI Interaction

updated 5 days ago
Upvote
-

  • InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners

    Paper • 2504.14239 • Published Apr 19 • 13

  • InfiX-ai/InfiGUI-R1-3B

    Image-Text-to-Text • 4B • Updated Apr 26 • 1.3k • 3

  • InfiX-ai/android_control_train

    Viewer • Updated Jun 8 • 13.6k • 35

  • InfiX-ai/android_control_test

    Updated May 21 • 37

  • InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection

    Paper • 2501.04575 • Published Jan 8 • 24

  • InfiX-ai/InfiGUIAgent-2B-Stage1

    Image-Text-to-Text • 2B • Updated Feb 6 • 26 • 2

  • InfiX-ai/InfiGUIAgent-Data

    Viewer • Updated Jan 23 • 2.89k • 53 • 3
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs