MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions Paper • 2112.00431 • Published Dec 1, 2021
OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection Paper • 2502.20361 • Published Feb 27 • 1
MatchDiffusion: Training-free Generation of Match-cuts Paper • 2411.18677 • Published Nov 27, 2024 • 1
UnMix-NeRF: Spectral Unmixing Meets Neural Radiance Fields Paper • 2506.21884 • Published Jun 27 • 12
Motion-Aware Concept Alignment for Consistent Video Editing Paper • 2506.01004 • Published Jun 1 • 7
OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions Paper • 2505.21724 • Published May 27 • 4
OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions Paper • 2505.21724 • Published May 27 • 4
An Embarrassingly Simple Defense Against LLM Abliteration Attacks Paper • 2505.19056 • Published May 25 • 5
An Embarrassingly Simple Defense Against LLM Abliteration Attacks Paper • 2505.19056 • Published May 25 • 5
An Embarrassingly Simple Defense Against LLM Abliteration Attacks Paper • 2505.19056 • Published May 25 • 5
MOLE: Metadata Extraction and Validation in Scientific Papers Using LLMs Paper • 2505.19800 • Published May 26 • 2
Masader: Metadata Sourcing for Arabic Text and Speech Data Resources Paper • 2110.06744 • Published Oct 13, 2021