Can Large Multimodal Models Uncover Deep Semantics Behind Images? Paper • 2402.11281 • Published Feb 17, 2024
Beyond Single Frames: Can LMMs Comprehend Temporal and Contextual Narratives in Image Sequences? Paper • 2502.13925 • Published Feb 19