Improve model card with pipeline tag and library name
#1
by
nielsr
HF Staff
- opened
README.md
CHANGED
@@ -1,15 +1,17 @@
|
|
1 |
---
|
2 |
-
license: mit
|
3 |
base_model:
|
4 |
- Qwen/Qwen2-VL-2B-Instruct
|
|
|
|
|
|
|
5 |
---
|
6 |
|
7 |
# GUI-Actor-2B with Qwen2-VL-2B as backbone VLM
|
8 |
|
9 |
-
This model was introduced in the paper [
|
10 |
It is developed based on [Qwen2-VL-2B-Instruct ](https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct), augmented by an attention-based action head and finetuned to perform GUI grounding using the dataset [here (coming soon)]().
|
11 |
|
12 |
-
For more details on model design and evaluation, please check: [π Project Page](https://
|
13 |
|
14 |
| Model Name | Hugging Face Link |
|
15 |
|--------------------------------------------|--------------------------------------------|
|
|
|
1 |
---
|
|
|
2 |
base_model:
|
3 |
- Qwen/Qwen2-VL-2B-Instruct
|
4 |
+
license: mit
|
5 |
+
library_name: transformers
|
6 |
+
pipeline_tag: image-text-to-text
|
7 |
---
|
8 |
|
9 |
# GUI-Actor-2B with Qwen2-VL-2B as backbone VLM
|
10 |
|
11 |
+
This model was introduced in the paper [GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents](https://www.arxiv.org/pdf/2506.03143).
|
12 |
It is developed based on [Qwen2-VL-2B-Instruct ](https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct), augmented by an attention-based action head and finetuned to perform GUI grounding using the dataset [here (coming soon)]().
|
13 |
|
14 |
+
For more details on model design and evaluation, please check: [π Project Page](https://microsoft.github.io/GUI-Actor/) | [π» Github Repo](https://github.com/microsoft/GUI-Actor) | [π Paper](https://www.arxiv.org/pdf/2506.03143).
|
15 |
|
16 |
| Model Name | Hugging Face Link |
|
17 |
|--------------------------------------------|--------------------------------------------|
|