-
Fashion-VDM: Video Diffusion Model for Virtual Try-On
Paper • 2411.00225 • Published • 11 -
HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level and Fidelity-Rich Conditions in Diffusion Models
Paper • 2410.22901 • Published • 8 -
Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
Paper • 2506.18898 • Published • 33
Zhongwei Zhang
zzwustc
AI & ML interests
AIGC
Recent Activity
liked
a dataset
18 days ago
UCSC-VLAA/GPT-Image-Edit-1.5M
upvoted
an
article
22 days ago
You could have designed state of the art positional encoding
liked
a model
about 1 month ago
ByteDance/Sa2VA-4B