VideoUFO: A Million-Scale User-Focused Dataset for Text-to-Video Generation Paper • 2503.01739 • Published Mar 3 • 8
Generalizable Origin Identification for Text-Guided Image-to-Image Diffusion Models Paper • 2501.02376 • Published Jan 4 • 3
TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation Paper • 2411.04709 • Published Nov 5, 2024 • 27
Replication in Visual Diffusion Models: A Survey and Outlook Paper • 2408.00001 • Published Jul 7, 2024
MonoFormer: One Transformer for Both Diffusion and Autoregression Paper • 2409.16280 • Published Sep 24, 2024 • 18
VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models Paper • 2403.06098 • Published Mar 10, 2024 • 17
Attentive WaveBlock: Complementarity-enhanced Mutual Networks for Unsupervised Domain Adaptation in Person Re-identification and Beyond Paper • 2006.06525 • Published Jun 11, 2020
Learning Anchored Unsigned Distance Functions with Gradient Direction Alignment for Single-view Garment Reconstruction Paper • 2108.08478 • Published Aug 19, 2021
DomainMix: Learning Generalizable Person Re-Identification Without Human Annotations Paper • 2011.11953 • Published Nov 24, 2020
Bag of Tricks and A Strong baseline for Image Copy Detection Paper • 2111.08004 • Published Nov 13, 2021
D$^2$LV: A Data-Driven and Local-Verification Approach for Image Copy Detection Paper • 2111.07090 • Published Nov 13, 2021
A Benchmark and Asymmetrical-Similarity Learning for Practical Image Copy Detection Paper • 2205.12358 • Published May 24, 2022
TransHP: Image Classification with Hierarchical Prompting Paper • 2304.06385 • Published Apr 13, 2023
Results and findings of the 2021 Image Similarity Challenge Paper • 2202.04007 • Published Feb 8, 2022
V$^2$L: Leveraging Vision and Vision-language Models into Large-scale Product Retrieval Paper • 2207.12994 • Published Jul 26, 2022
Feature-compatible Progressive Learning for Video Copy Detection Paper • 2304.10305 • Published Apr 20, 2023