PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation Paper • 2308.09678 • Published Aug 18, 2023
Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models Paper • 2410.19635 • Published Oct 25, 2024
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models Paper • 2501.18954 • Published Jan 31
ViSpeak: Visual Instruction Feedback in Streaming Videos Paper • 2503.12769 • Published 6 days ago • 8
ViSpeak: Visual Instruction Feedback in Streaming Videos Paper • 2503.12769 • Published 6 days ago • 8