Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zhibinlan 's Collections
LLaVE

LLaVE

updated Mar 10

LLaVE is a series of large language and vision embedding models trained on a variety of multimodal embedding datasets

Upvote
8

  • zhibinlan/LLaVE-0.5B

    Image-Text-to-Text • Updated Mar 14 • 2.93k • 7

  • zhibinlan/LLaVE-2B

    Image-Text-to-Text • Updated Mar 14 • 19.6k • 45

  • zhibinlan/LLaVE-7B

    Image-Text-to-Text • Updated Mar 14 • 1.61k • 5

  • LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning

    Paper • 2503.04812 • Published Mar 4 • 14
Upvote
8
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs