How to perform object detection?

by Godsing - opened 5 days ago

Discussion

Godsing

5 days ago

•

edited 5 days ago

The Model card mentions:

Qwen2.5-VL can accurately localize objects in an image by generating bounding boxes or points, and it can provide stable JSON outputs for coordinates and attributes.

I can't find any usage examples. And when I try to write prompts by myself, it fails to obtain the expected result..
So, is there an usage example?

Godsing

5 days ago

I think I have found it: https://github.com/QwenLM/Qwen2.5-VL/blob/main/cookbooks/spatial_understanding.ipynb

Godsing changed discussion status to closed 5 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment