ViSpeak: Visual Instruction Feedback in Streaming Videos Paper • 2503.12769 • Published 6 days ago • 8
ViSpeak: Visual Instruction Feedback in Streaming Videos Paper • 2503.12769 • Published 6 days ago • 8
Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition Paper • 2412.09501 • Published Dec 12, 2024 • 45
EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding Paper • 2406.08877 • Published Jun 13, 2024