Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation Paper • 2506.09350 • Published 4 days ago • 45
InterActHuman: Multi-Concept Human Animation with Layout-Aligned Audio Conditions Paper • 2506.09984 • Published 3 days ago • 12
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published Feb 3 • 218 • 20
ViM: Vision Middleware for Unified Downstream Transferring Paper • 2303.06911 • Published Mar 13, 2023
Rethinking Supervised Pre-training for Better Downstream Transferring Paper • 2110.06014 • Published Oct 12, 2021
RLIPv2: Fast Scaling of Relational Language-Image Pre-training Paper • 2308.09351 • Published Aug 18, 2023
VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval Paper • 2211.12764 • Published Nov 23, 2022
Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency Paper • 2409.02634 • Published Sep 4, 2024 • 98
CyberHost: Taming Audio-driven Avatar Diffusion Model with Region Codebook Attention Paper • 2409.01876 • Published Sep 3, 2024 • 2
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published Feb 3 • 218
FADA: Fast Diffusion Avatar Synthesis with Mixed-Supervised Multi-CFG Distillation Paper • 2412.16915 • Published Dec 22, 2024
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published Feb 3 • 218
Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency Paper • 2409.02634 • Published Sep 4, 2024 • 98