VLM4D: Towards Spatiotemporal Awareness in Vision Language Models Paper • 2508.02095 • Published 8 days ago • 5
VLM4D: Towards Spatiotemporal Awareness in Vision Language Models Paper • 2508.02095 • Published 8 days ago • 5 • 2
Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields Paper • 2503.20776 • Published Mar 26 • 8
Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields Paper • 2503.20776 • Published Mar 26 • 8