Many VLMs claim to process hours of video. But can they follow the story?π€ Today, we introduce TimeScope: The benchmark that separates true temporal understanding from marketing hype. Let's see how much VLMs really understand!β³
The results are in, and they're revealing. Only Gemini 2.5 pro handles 1-hour-long videos. Performance drops sharply with duration, proving that long video understanding is still challenging. We've found the breaking pointsβnow the community can start fixing them.π
Want to learn more? TimeScope is 100% open-source. Benchmark your model and help us build the next generation of video AI.