VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs Paper β’ 2406.07476 β’ Published Jun 11, 2024 β’ 38
DAMO-NLP-SG/VideoLLaMA2.1-7B-16F Visual Question Answering β’ 8B β’ Updated Oct 21, 2024 β’ 3.39k β’ 10
DAMO-NLP-SG/VideoLLaMA2.1-7B-AV Visual Question Answering β’ 9B β’ Updated Oct 25, 2024 β’ 1.06k β’ 14
DAMO-NLP-SG/VideoLLaMA2-7B Visual Question Answering β’ 8B β’ Updated Aug 13, 2024 β’ 2.79k β’ 41
DAMO-NLP-SG/VideoLLaMA2-7B-16F Visual Question Answering β’ 8B β’ Updated Aug 13, 2024 β’ 355 β’ 15
DAMO-NLP-SG/VideoLLaMA2.1-7B-16F-Base Visual Question Answering β’ Updated Oct 21, 2024 β’ 53 β’ 1