Zery
/

Safetensors
qwen2_vl
nielsr HF Staff commited on
Commit
abb5bd7
·
verified ·
1 Parent(s): 4fdd15f

Improve model card for SEAgent: Add metadata and description

Browse files

This PR significantly enhances the model card for SEAgent by:
- Adding the `pipeline_tag: image-text-to-text`, which helps users discover the model when filtering by task on the Hugging Face Hub.
- Specifying the `library_name: transformers` to indicate compatibility with the `transformers` library, enabling the "How to use" widget on the model page.
- Providing a concise yet informative description of the model's capabilities and purpose, derived from the paper's abstract.
- Updating the paper link from arXiv to the official Hugging Face papers page for better integration and user experience.

Files changed (1) hide show
  1. README.md +17 -6
README.md CHANGED
@@ -1,13 +1,24 @@
1
  ---
2
- license: apache-2.0
3
- datasets:
4
- - xlangai/ubuntu_osworld
5
  base_model:
6
  - ByteDance-Seed/UI-TARS-7B-DPO
 
 
 
 
 
7
  ---
8
 
9
- Checkout our github repo and arxiv paper for inference!
 
 
 
 
 
 
 
 
10
 
11
- https://github.com/SunzeY/SEAgent
 
12
 
13
- https://arxiv.org/abs/2508.04700
 
1
  ---
 
 
 
2
  base_model:
3
  - ByteDance-Seed/UI-TARS-7B-DPO
4
+ datasets:
5
+ - xlangai/ubuntu_osworld
6
+ license: apache-2.0
7
+ pipeline_tag: image-text-to-text
8
+ library_name: transformers
9
  ---
10
 
11
+ # SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience
12
+
13
+ This repository hosts `SEAgent`, an innovative agentic self-evolving framework designed to empower Large Vision-Language Models (LVLMs) to function as Computer Use Agents (CUAs). SEAgent enables these agents to autonomously master novel and specialized software environments, particularly in scenarios where human annotations are scarce.
14
+
15
+ The framework achieves this through experiential learning, allowing agents to explore new software, learn via iterative trial-and-error, and progressively tackle auto-generated tasks organized from simple to complex. Key innovations include a **World State Model** for step-wise trajectory assessment and a **Curriculum Generator** that generates increasingly diverse and challenging tasks. The agent's policy is updated through a novel experiential learning approach, combining adversarial imitation of failure actions and Group Relative Policy Optimization (GRPO) for successful ones. Furthermore, SEAgent introduces a specialist-to-generalist training strategy to develop a stronger generalist CUA capable of continuous autonomous evolution.
16
+
17
+ SEAgent demonstrates significant performance improvements, achieving a 23.2% increase in success rate on five novel software environments within OS-World, surpassing competitive open-source CUAs like UI-TARS.
18
+
19
+ For more detailed information, please refer to our paper and code:
20
 
21
+ * **Paper:** [SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience](https://huggingface.co/papers/2508.04700)
22
+ * **Code:** [https://github.com/SunzeY/SEAgent](https://github.com/SunzeY/SEAgent)
23
 
24
+ For inference and usage instructions, please consult the official GitHub repository.