Liu Songhua's picture

6 43 4

Liu Songhua

Huage001

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

upvoted a paper 22 days ago

Image Editing As Programs with Diffusion Models

upvoted a paper 28 days ago

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

View all activity

Organizations

None yet

upvoted a paper 4 days ago

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published 8 days ago • 105

upvoted a paper 22 days ago

Image Editing As Programs with Diffusion Models

Paper • 2506.04158 • Published 23 days ago • 24

upvoted 2 papers 28 days ago

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Paper • 2505.19147 • Published May 25 • 145

Alchemist: Turning Public Text-to-Image Data into Generative Gold

Paper • 2505.19297 • Published May 25 • 78

upvoted 3 papers about 1 month ago

Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression

Paper • 2505.19602 • Published May 26 • 13

Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps

Paper • 2505.18675 • Published May 24 • 23

VeriThinker: Learning to Verify Makes Reasoning Model Efficient

Paper • 2505.17941 • Published May 23 • 24

upvoted 2 papers 2 months ago

SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL

Paper • 2504.11455 • Published Apr 15 • 14

Efficient Reasoning Models: A Survey

Paper • 2504.10903 • Published Apr 15 • 18

upvoted 8 papers 3 months ago

Ultra-Resolution Adaptation with Ease

Paper • 2503.16322 • Published Mar 20 • 13

Scale-wise Distillation of Diffusion Models

Paper • 2503.16397 • Published Mar 20 • 42

Video-T1: Test-Time Scaling for Video Generation

Paper • 2503.18942 • Published Mar 24 • 89

AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset

Paper • 2503.19462 • Published Mar 25 • 10

Personalize Anything for Free with Diffusion Transformer

Paper • 2503.12590 • Published Mar 16 • 44

FlowTok: Flowing Seamlessly Across Text and Image Tokens

Paper • 2503.10772 • Published Mar 13 • 19

Kolmogorov-Arnold Attention: Is Learnable Attention Better For Vision Transformers?

Paper • 2503.10632 • Published Mar 13 • 14

PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity

Paper • 2503.07677 • Published Mar 10 • 86

upvoted 3 papers 4 months ago

Automated Movie Generation via Multi-Agent CoT Planning

Paper • 2503.07314 • Published Mar 10 • 45

ObjectMover: Generative Object Movement with Video Prior

Paper • 2503.08037 • Published Mar 11 • 4

One-step Diffusion Models with f-Divergence Distribution Matching

Paper • 2502.15681 • Published Feb 21 • 8