Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
lmms-lab 's Collections
Aero-1-Audio
EgoLife
VideoMMMU
Multimodal-SAE
LLaVA-Critic
LLaVA-Video
LLaVA-OneVision
LMMs-Eval
LongVA
LLaVA-Next-Interleave
LLaVA-NeXT
LMMs-Eval-Lite

LLaVA-Video

updated Feb 21

Models focus on video understanding (previously known as LLaVA-NeXT-Video).

Upvote
61

  • Video Instruction Tuning With Synthetic Data

    Paper • 2410.02713 • Published Oct 3, 2024 • 39

  • lmms-lab/LLaVA-Video-178K

    Viewer • Updated Oct 11, 2024 • 1.63M • 10.2k • 142

  • lmms-lab/LLaVA-Video-7B-Qwen2

    Video-Text-to-Text • Updated Oct 25, 2024 • 50.4k • 93

  • lmms-lab/LLaVA-Video-72B-Qwen2

    Text Generation • Updated Oct 25, 2024 • 1.05k • 19

  • lmms-lab/LLaVA-Video-7B-Qwen2-Video-Only

    Text Generation • Updated Oct 4, 2024 • 489 • 4

  • lmms-lab/LLaVA-NeXT-Video-7B-DPO

    Video-Text-to-Text • Updated Feb 21 • 1.39k • 27

  • lmms-lab/LLaVA-NeXT-Video-7B

    Video-Text-to-Text • Updated Feb 21 • 645 • 47
Upvote
61
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs