100
Describe Anything
⚡
Describe parts of images using text prompts
Chat with an AI that understands text and images
Chat with an AI language model
Convert videos to BVH motion files
Segment objects in images using prompts
Engage in multi-modal conversations with images and videos
image captioning, VQA
VGGT (CVPR 2025)