nielsr HF Staff commited on
Commit
574aace
Β·
verified Β·
1 Parent(s): 4770c51

Improve model card with pipeline tag and library name

Browse files

This PR improves the model card by:

- Adding the `pipeline_tag: image-text-to-text` to better categorize the model on the Hugging Face Hub. This allows users searching for image-to-text models to easily find this one.
- Specifying the `library_name: transformers`, clarifying that the model utilizes the Transformers library.

This enhances discoverability and provides crucial context for users.

Files changed (1) hide show
  1. README.md +5 -3
README.md CHANGED
@@ -1,15 +1,17 @@
1
  ---
2
- license: mit
3
  base_model:
4
  - Qwen/Qwen2-VL-2B-Instruct
 
 
 
5
  ---
6
 
7
  # GUI-Actor-2B with Qwen2-VL-2B as backbone VLM
8
 
9
- This model was introduced in the paper [**GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents**](https://aka.ms/GUI-Actor).
10
  It is developed based on [Qwen2-VL-2B-Instruct ](https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct), augmented by an attention-based action head and finetuned to perform GUI grounding using the dataset [here (coming soon)]().
11
 
12
- For more details on model design and evaluation, please check: [🏠 Project Page](https://aka.ms/GUI-Actor) | [πŸ’» Github Repo](https://github.com/microsoft/GUI-Actor) | [πŸ“‘ Paper](https://www.arxiv.org/pdf/2506.03143).
13
 
14
  | Model Name | Hugging Face Link |
15
  |--------------------------------------------|--------------------------------------------|
 
1
  ---
 
2
  base_model:
3
  - Qwen/Qwen2-VL-2B-Instruct
4
+ license: mit
5
+ library_name: transformers
6
+ pipeline_tag: image-text-to-text
7
  ---
8
 
9
  # GUI-Actor-2B with Qwen2-VL-2B as backbone VLM
10
 
11
+ This model was introduced in the paper [GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents](https://www.arxiv.org/pdf/2506.03143).
12
  It is developed based on [Qwen2-VL-2B-Instruct ](https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct), augmented by an attention-based action head and finetuned to perform GUI grounding using the dataset [here (coming soon)]().
13
 
14
+ For more details on model design and evaluation, please check: [🏠 Project Page](https://microsoft.github.io/GUI-Actor/) | [πŸ’» Github Repo](https://github.com/microsoft/GUI-Actor) | [πŸ“‘ Paper](https://www.arxiv.org/pdf/2506.03143).
15
 
16
  | Model Name | Hugging Face Link |
17
  |--------------------------------------------|--------------------------------------------|