MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions Paper • 2112.00431 • Published Dec 1, 2021
OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection Paper • 2502.20361 • Published Feb 27 • 1
MatchDiffusion: Training-free Generation of Match-cuts Paper • 2411.18677 • Published Nov 27, 2024 • 1