Whole-Body Conditioned Egocentric Video Prediction Paper • 2506.21552 • Published 16 days ago • 11
Whole-Body Conditioned Egocentric Video Prediction Paper • 2506.21552 • Published 16 days ago • 11
Whole-Body Conditioned Egocentric Video Prediction Paper • 2506.21552 • Published 16 days ago • 11 • 1
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information? Paper • 2412.02611 • Published Dec 3, 2024 • 24
Evaluating Multiview Object Consistency in Humans and Image Models Paper • 2409.05862 • Published Sep 9, 2024 • 11