InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding Paper • 2403.15377 • Published Mar 22, 2024 • 23
InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling Paper • 2501.12386 • Published about 1 month ago • 1