Negative-Guided Subject Fidelity Optimization for Zero-Shot Subject-Driven Generation Paper • 2506.03621 • Published Jun 4 • 22
Negative-Guided Subject Fidelity Optimization for Zero-Shot Subject-Driven Generation Paper • 2506.03621 • Published Jun 4 • 22
Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage Paper • 2412.15484 • Published Dec 20, 2024 • 15
Style-Friendly SNR Sampler for Style-Driven Generation Paper • 2411.14793 • Published Nov 22, 2024 • 40 • 5
Unified Speech-Text Pretraining for Spoken Dialog Modeling Paper • 2402.05706 • Published Feb 8, 2024 • 6
VoiceGuider: Enhancing Out-of-Domain Performance in Parameter-Efficient Speaker-Adaptive Text-to-Speech via Autoguidance Paper • 2409.15759 • Published Sep 24, 2024 • 1
NanoVoice: Efficient Speaker-Adaptive Text-to-Speech for Multiple Speakers Paper • 2409.15760 • Published Sep 24, 2024 • 1
VoiceGuider: Enhancing Out-of-Domain Performance in Parameter-Efficient Speaker-Adaptive Text-to-Speech via Autoguidance Paper • 2409.15759 • Published Sep 24, 2024 • 1
NanoVoice: Efficient Speaker-Adaptive Text-to-Speech for Multiple Speakers Paper • 2409.15760 • Published Sep 24, 2024 • 1
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models Paper • 2411.18613 • Published Nov 27, 2024 • 58
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator Paper • 2411.15466 • Published Nov 23, 2024 • 39
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator Paper • 2411.15466 • Published Nov 23, 2024 • 39
Disentangled Motion Modeling for Video Frame Interpolation Paper • 2406.17256 • Published Jun 25, 2024
Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation Paper • 2403.10911 • Published Mar 16, 2024
Style-Friendly SNR Sampler for Style-Driven Generation Paper • 2411.14793 • Published Nov 22, 2024 • 40
Style-Friendly SNR Sampler for Style-Driven Generation Paper • 2411.14793 • Published Nov 22, 2024 • 40
Style-Friendly SNR Sampler for Style-Driven Generation Paper • 2411.14793 • Published Nov 22, 2024 • 40 • 5