Zery
/

Safetensors
qwen2_vl

Improve model card for SEAgent: Add metadata and description

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +17 -6
README.md CHANGED
@@ -1,13 +1,24 @@
1
  ---
2
- license: apache-2.0
3
- datasets:
4
- - xlangai/ubuntu_osworld
5
  base_model:
6
  - ByteDance-Seed/UI-TARS-7B-DPO
 
 
 
 
 
7
  ---
8
 
9
- Checkout our github repo and arxiv paper for inference!
 
 
 
 
 
 
 
 
10
 
11
- https://github.com/SunzeY/SEAgent
 
12
 
13
- https://arxiv.org/abs/2508.04700
 
1
  ---
 
 
 
2
  base_model:
3
  - ByteDance-Seed/UI-TARS-7B-DPO
4
+ datasets:
5
+ - xlangai/ubuntu_osworld
6
+ license: apache-2.0
7
+ pipeline_tag: image-text-to-text
8
+ library_name: transformers
9
  ---
10
 
11
+ # SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience
12
+
13
+ This repository hosts `SEAgent`, an innovative agentic self-evolving framework designed to empower Large Vision-Language Models (LVLMs) to function as Computer Use Agents (CUAs). SEAgent enables these agents to autonomously master novel and specialized software environments, particularly in scenarios where human annotations are scarce.
14
+
15
+ The framework achieves this through experiential learning, allowing agents to explore new software, learn via iterative trial-and-error, and progressively tackle auto-generated tasks organized from simple to complex. Key innovations include a **World State Model** for step-wise trajectory assessment and a **Curriculum Generator** that generates increasingly diverse and challenging tasks. The agent's policy is updated through a novel experiential learning approach, combining adversarial imitation of failure actions and Group Relative Policy Optimization (GRPO) for successful ones. Furthermore, SEAgent introduces a specialist-to-generalist training strategy to develop a stronger generalist CUA capable of continuous autonomous evolution.
16
+
17
+ SEAgent demonstrates significant performance improvements, achieving a 23.2% increase in success rate on five novel software environments within OS-World, surpassing competitive open-source CUAs like UI-TARS.
18
+
19
+ For more detailed information, please refer to our paper and code:
20
 
21
+ * **Paper:** [SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience](https://huggingface.co/papers/2508.04700)
22
+ * **Code:** [https://github.com/SunzeY/SEAgent](https://github.com/SunzeY/SEAgent)
23
 
24
+ For inference and usage instructions, please consult the official GitHub repository.