Vision Language General Vision Language General MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks Paper • 2410.10563 • Published Oct 14 • 37 Latent Action Pretraining from Videos Paper • 2410.11758 • Published Oct 15 • 2 TVBench: Redesigning Video-Language Evaluation Paper • 2410.07752 • Published Oct 10 • 5
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks Paper • 2410.10563 • Published Oct 14 • 37