-
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction
Paper • 2404.02905 • Published • 74 -
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation
Paper • 2404.02733 • Published • 23 -
Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models
Paper • 2404.02747 • Published • 13 -
Bigger is not Always Better: Scaling Properties of Latent Diffusion Models
Paper • 2404.01367 • Published • 23
Chenxin Li
XGGNet
AI & ML interests
None yet
Recent Activity
liked
a model
6 days ago
Qwen/Qwen-Image
upvoted
a
paper
about 1 month ago
IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as
Agentic Inverse Rendering
upvoted
a
paper
about 2 months ago
JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo
Retouching Agent
Organizations
None yet