1 15 7

Jianzong Wu PRO

jianzongwu

https://jianzongwu.github.io

jianzongwu

AI & ML interests

Multimodal Learning

Recent Activity

upvoted a paper 2 days ago

LongLive: Real-time Interactive Long Video Generation

upvoted a paper about 2 months ago

Qwen-Image Technical Report

upvoted a paper about 2 months ago

Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models

View all activity

Organizations

None yet

upvoted a paper 2 days ago

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published 4 days ago • 156

upvoted 2 papers about 2 months ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 255

Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models

Paper • 2508.00819 • Published Aug 1 • 62

upvoted a paper 2 months ago

ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts

Paper • 2507.20939 • Published Jul 28 • 56

upvoted 4 papers 3 months ago

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

Paper • 2507.07104 • Published Jul 9 • 45

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

Paper • 2507.07999 • Published Jul 10 • 48

Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation

Paper • 2506.19852 • Published Jun 24 • 41

VMoBA: Mixture-of-Block Attention for Video Diffusion Models

Paper • 2506.23858 • Published Jun 30 • 31

commented a paper 3 months ago

VMoBA: Mixture-of-Block Attention for Video Diffusion Models

Paper • 2506.23858 • Published Jun 30 • 31 •

liked a dataset 3 months ago

OpenGVLab/OmniCorpus-CC-210M

Viewer • Updated Mar 20 • 208M • 559 • 31

upvoted a paper 5 months ago

On Path to Multimodal Generalist: General-Level and General-Bench

Paper • 2505.04620 • Published May 7 • 82

upvoted a paper 6 months ago

An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published Apr 8 • 63

authored a paper 6 months ago

An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published Apr 8 • 63

authored a paper 10 months ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 48

upvoted a paper 10 months ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 48

updated a dataset 10 months ago

jianzongwu/MangaZero

Viewer • Updated Dec 11, 2024 • 32.7k • 69 • 30

updated a model 10 months ago

jianzongwu/DiffSensei

Updated Dec 11, 2024 • 39

upvoted a paper 10 months ago

Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language

Paper • 2406.20085 • Published Jun 28, 2024 • 13

liked a Space 10 months ago

Meissonic Flow

🚀

Generate images from text descriptions

updated a dataset 10 months ago

jianzongwu/MotionBooth

Preview • Updated Nov 22, 2024 • 42

Jianzong Wu PRO

AI & ML interests

Recent Activity

Organizations

jianzongwu's activity

Meissonic Flow