Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
tasal9 's Collections
ZamAI Audio / ASR
ZamAI Embeddings & Search
ZamAI Multimodal (Image + Text)

ZamAI Multimodal (Image + Text)

updated Jun 29
Upvote
1

  • llava-hf/llava-1.5-7b-hf

    Image-Text-to-Text • 7B • Updated Jun 6 • 1.38M • 325

    Note visual question answering.


  • Salesforce/blip-image-captioning-base

    Image-to-Text • Updated Feb 3 • 2.3M • 822

    Note captioning for UIs or accessibility.


  • openai/clip-vit-large-patch14

    Zero-Shot Image Classification • 0.4B • Updated Sep 15, 2023 • 8.46M • 1.92k

    Note image embeddings, great for search.

Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs