VLM2Vec

community

https://github.com/TIGER-AI-Lab/VLM2Vec

AI & ML interests

Multimodal Embeddings and Retrieval.

Recent Activity

ziyjiang updated a Space 4 days ago

Lux1997 updated a dataset 20 days ago

VLM2Vec/MMEB-V3

memray authored a paper about 1 month ago

XGen-7B Technical Report

View all activity

Organization Card

Community About org cards

VLM2Vec & MMEB: Benchmarking multimodal embeddings and adapting state-of-the-art multimodal large language models into embedding models.

Website - https://tiger-ai-lab.github.io/VLM2Vec/
Github https://github.com/TIGER-AI-Lab/VLM2Vec

List of Our Papers

Main VLM2Vec / MMEB Series

VLM2Vec / MMEB – Image embedding benchmarking and models. (ICLR2025)
VLM2Vec-V2 / MMEB-V2 – Extension of our previous work to video and visual document tasks. (TMLR2026)
MMEB-V3 - Extension of our previous work to Omni-modality. (COLM2026)

Other Related Papers from Our Team

GAE-Retriever – Benchmark and model for trajectory modeling in GUI environments. (Computer-use Agents@ICML 2025)
B3 – A novel batch mining strategy for contrastive learning. (Neurips2025)

spaces 2

MMEB Leaderboard

The massive multimodal embedding benchmark

models 1

VLM2Vec/VLM2Vec-V2.0

Image-Text-to-Text • Updated Jul 13, 2025 • 11.8k • 29

datasets 45

VLM2Vec/MMEB-V3

Viewer • Updated 20 days ago • 36k • 968 • 2

VLM2Vec/GAE-Mind2Web

Viewer • Updated Feb 11 • 12.1k • 55

VLM2Vec/GAE-GUIAct

Viewer • Updated Feb 11 • 74.3k • 14

VLM2Vec/Video_Caption_HN

Viewer • Updated Dec 20, 2025 • 302k • 40

VLM2Vec/MMLongBench-page-fixed

Viewer • Updated Nov 4, 2025 • 8.91k • 2.91k

VLM2Vec/ViDoSeek-page-fixed

Viewer • Updated Nov 4, 2025 • 8.78k • 616

VLM2Vec/MMEB-V2

Updated Sep 24, 2025 • 189 • 2

VLM2Vec/B3-7b

Viewer • Updated Aug 29, 2025 • 1.03M • 16 • 1

VLM2Vec/B3-2b

Viewer • Updated Aug 29, 2025 • 1.03M • 18

VLM2Vec/MVBench

Viewer • Updated Aug 15, 2025 • 4k • 1.17k

View 45 datasets