LLaVA-Video - a lmms-lab Collection

lmms-lab 's Collections

EgoLife

LLaVA-OneVision

LongVA

LLaVA-Next-Interleave

LLaVA-Video

updated Feb 21

Models focus on video understanding (previously known as LLaVA-NeXT-Video).

Video Instruction Tuning With Synthetic Data

Paper • 2410.02713 • Published Oct 3, 2024 • 39
lmms-lab/LLaVA-Video-178K

Viewer • Updated Oct 11, 2024 • 1.63M • 25.2k • 152
lmms-lab/LLaVA-Video-7B-Qwen2

Video-Text-to-Text • 8B • Updated Oct 25, 2024 • 134k • 102
lmms-lab/LLaVA-Video-72B-Qwen2

Text Generation • 73B • Updated Oct 25, 2024 • 375 • 20
lmms-lab/LLaVA-Video-7B-Qwen2-Video-Only

Text Generation • 8B • Updated Oct 4, 2024 • 736 • 4
lmms-lab/LLaVA-NeXT-Video-32B-Qwen

Video-Text-to-Text • 33B • Updated Oct 4, 2024 • 216 • 15
lmms-lab/LLaVA-NeXT-Video-7B-DPO

Video-Text-to-Text • 7B • Updated Feb 21 • 5.48k • 28
lmms-lab/LLaVA-NeXT-Video-7B

Video-Text-to-Text • 7B • Updated Feb 21 • 994 • 48