DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion Paper • 2411.04928 • Published 10 days ago • 43
GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details Paper • 2411.03047 • Published 12 days ago • 6
VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos Paper • 2411.04923 • Published 10 days ago • 20
ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning Paper • 2411.05003 • Published 10 days ago • 65
InstantIR: Blind Image Restoration with Instant Generative Reference Paper • 2410.06551 • Published Oct 9 • 6
SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation Paper • 2410.23277 • Published 18 days ago • 7
AutoVFX: Physically Realistic Video Editing from Natural Language Instructions Paper • 2411.02394 • Published 13 days ago • 15
Adaptive Caching for Faster Video Generation with Diffusion Transformers Paper • 2411.02397 • Published 13 days ago • 20
DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation Paper • 2410.13726 • Published about 1 month ago • 10
AutoTrain: No-code training for state-of-the-art models Paper • 2410.15735 • Published 27 days ago • 56
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree Paper • 2410.16268 • Published 27 days ago • 65
MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark Paper • 2410.19168 • Published 24 days ago • 19
view article Article Advanced Flux Dreambooth LoRA Training with 🧨 diffusers By linoyts • 27 days ago • 27
Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations Paper • 2410.10792 • Published Oct 14 • 26
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion Paper • 2410.03825 • Published Oct 4 • 17
T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design Paper • 2410.05677 • Published Oct 8 • 14