Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
TIGER-Lab 's Collections
MoCha
General-Reasoner
VL-Rethinker
Vamba
TheoremExplain
ABC
VisualWebInstruct
PixelWorld
AceCoder
CritiqueFineTuning
MAmmoTH-VL
ScholarCopilot
VISTA
OmniEdit
MEGA-Bench
VLM2Vec
TIGERScore
MAmmoTH
UniIR
ImagenHub
Science
StructLM
ConsistI2V
Mantis
MAmmoTH2
VideoScore
Long-Context

VISTA

updated Dec 20, 2024

Video Augmentation for Synthetic Video Instruction-following Data Generation

Upvote
-

  • TIGER-Lab/VISTA-LongVA

    Video-Text-to-Text • Updated Mar 14 • 34 • 2

  • TIGER-Lab/VISTA-Mantis

    Video-Text-to-Text • Updated Mar 14 • 3

  • TIGER-Lab/VISTA-VideoLLaVA

    Video-Text-to-Text • Updated Mar 14 • 3

  • VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation

    Paper • 2412.00927 • Published Dec 1, 2024 • 28

  • TIGER-Lab/VISTA-400K

    Viewer • Updated Dec 19, 2024 • 381k • 593 • 2

  • TIGER-Lab/HRVideoBench

    Viewer • Updated Dec 20, 2024 • 200 • 108 • 1
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs