Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

VLM2Vec

community
https://github.com/TIGER-AI-Lab/VLM2Vec
Activity Feed

AI & ML interests

Multimodal Embeddings and Retrieval.

Recent Activity

ziyjiang  updated a model 4 days ago
VLM2Vec/VLM2Vec-V2.0
MINGYISU  authored a paper 6 days ago
Breaking the Batch Barrier (B3) of Contrastive Learning via Smart Batch Mining
MINGYISU  authored a paper 6 days ago
VLM2Vec-V2: Advancing Multimodal Embedding for Videos, Images, and Visual Documents
View all activity

Xuan "Billy" Zhang's profile picture Rui's profile picture Ziyan Jiang's profile picture Xinyi Yang's profile picture Liu's profile picture MINGYI SU's profile picture

models 1

VLM2Vec/VLM2Vec-V2.0

Image-to-Text • Updated 4 days ago • 2.43k • 7

datasets 21

VLM2Vec/MomentSeeker

Viewer • Updated 21 days ago • 1.8k • 157

VLM2Vec/Charades-STA

Viewer • Updated 21 days ago • 727 • 127

VLM2Vec/QVHighlight

Viewer • Updated 21 days ago • 1.08k • 576

VLM2Vec/MMEB-V2

Updated Jun 13 • 268

VLM2Vec/Kinetics-700

Viewer • Updated May 31 • 1k • 489

VLM2Vec/ViDoRe_esg_reports_human_labeled_v2

Viewer • Updated May 31 • 1.72k • 11

VLM2Vec/ViDoRe_economics_reports_v2

Viewer • Updated May 30 • 1.42k • 10

VLM2Vec/ViDoRe_biomedical_lectures_v2_multilingual

Viewer • Updated May 30 • 3.74k • 13

VLM2Vec/ViDoRe_biomedical_lectures_v2

Viewer • Updated May 30 • 1.72k • 20

VLM2Vec/ViDoRe_esg_reports_v2_multilingual

Viewer • Updated May 30 • 2.68k • 9
View 21 datasets
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs