LuxDiT: Lighting Estimation with Video Diffusion Transformer Paper • 2509.03680 • Published 5 days ago • 2
Few-step Flow for 3D Generation via Marginal-Data Transport Distillation Paper • 2509.04406 • Published 4 days ago • 10
Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation Paper • 2509.00428 • Published 9 days ago • 14
FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games Paper • 2509.01052 • Published 7 days ago • 19
GenCompositor: Generative Video Compositing with Diffusion Transformer Paper • 2509.02460 • Published 6 days ago • 22
Droplet3D: Commonsense Priors from Videos Facilitate 3D Generation Paper • 2508.20470 • Published 11 days ago • 64
Collaborative Multi-Modal Coding for High-Quality 3D Generation Paper • 2508.15228 • Published 18 days ago • 4
Dress&Dance: Dress up and Dance as You Like It - Technical Preview Paper • 2508.21070 • Published 11 days ago • 5
OneReward: Unified Mask-Guided Image Generation via Multi-Task Human Preference Learning Paper • 2508.21066 • Published 11 days ago • 12
USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning Paper • 2508.18966 • Published 13 days ago • 56
Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning Paper • 2508.20751 • Published 11 days ago • 85
MotionFlux: Efficient Text-Guided Motion Generation through Rectified Flow Matching and Preference Alignment Paper • 2508.19527 • Published 12 days ago • 9
Diffusion Language Models Know the Answer Before Decoding Paper • 2508.19982 • Published 12 days ago • 22
MIDAS: Multimodal Interactive Digital-human Synthesis via Real-time Autoregressive Video Generation Paper • 2508.19320 • Published 13 days ago • 27
ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models Paper • 2508.18271 • Published 14 days ago • 7