osunlp
/

UGround-V1-2B

Image-Text-to-Text

text-generation-inference

Model card Files Files and versions Community

BoyuNLP commited on Jan 3

Commit

ed66472

·

verified ·

1 Parent(s): f77f43f

Update README.md

Files changed (1) hide show

README.md +21 -5

README.md CHANGED Viewed

@@ -24,21 +24,35 @@ UGround is a storng GUI visual grounding model trained with a simple recipe. Che
 - [x] Model Weights
 - [ ] Code
   - [ ] Inference Code of UGround
   - [x] Offline Experiments
     - [x] Screenspot (along with referring expressions generated by GPT-4/4o)
     - [x] Multimodal-Mind2Web
     - [x] OmniAct
   - [ ] Online Experiments
-    - [ ] Mind2Web-Live
-    - [ ] AndroidWorld
-- [ ] Data
   - [ ] Data Examples
   - [ ] Data Construction Scripts
-  - [ ] Guidance of Open-source Data
 - [x] Online Demo (HF Spaces)
 ## Main Results
@@ -112,9 +126,11 @@ messages = format_openai_template(description, base64_image)
 completion = await client.chat.completions.create(
     model=args.model_path,
     messages=messages,
-    temperature=0
 )
 ```

 - [x] Model Weights
+- [x] Qwen2-VL-based V1
 - [ ] Code
   - [ ] Inference Code of UGround
   - [x] Offline Experiments
     - [x] Screenspot (along with referring expressions generated by GPT-4/4o)
     - [x] Multimodal-Mind2Web
     - [x] OmniAct
+    - [ ] Android Control
   - [ ] Online Experiments
+    - [ ] Mind2Web-Live-SeeAct-V
+    - [ ] AndroidWorld-SeeAct-V
+- [ ] Data-V1
   - [ ] Data Examples
   - [ ] Data Construction Scripts
+  - [ ] Guidance of Open-source Data
+- [ ] Data-V1.1
 - [x] Online Demo (HF Spaces)
+## Models
+Initial UGround-V1:
+UGround-V1-2B (Qwen2-VL): https://huggingface.co/osunlp/UGround-V1-2B
+UGround-V1-7B (Qwen2-VL): https://huggingface.co/osunlp/UGround-V1-7B
+UGround-V1-72B (Qwen2-VL): Coming Soon
+UGround-V1.1-2B (Qwen2-VL): Coming Soon
+UGround-V1.1-7B (Qwen2-VL): Coming Soon
+UGround-V1.1-72B (Qwen2-VL): Coming Soon
 ## Main Results
 completion = await client.chat.completions.create(
     model=args.model_path,
     messages=messages,
+    temperature=0  # Remember to set temperature to ZERO!
 )
+# The output will be in the range of [0,999), which is compatible with the original Qwen2-VL
 ```