wangshuai's picture

Open to Collab

wangshuai

wangsssssss

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

SemanticGen: Video Generation in Semantic Space

upvoted a paper 20 days ago

Bidirectional Normalizing Flow: From Data to Noise and Back

upvoted a paper 25 days ago

Towards Scalable Pre-training of Visual Tokenizers for Generation

View all activity

Organizations

upvoted a paper 17 days ago

SemanticGen: Video Generation in Semantic Space

Paper • 2512.20619 • Published 18 days ago • 90

upvoted a paper 20 days ago

Bidirectional Normalizing Flow: From Data to Noise and Back

Paper • 2512.10953 • Published 30 days ago • 5

upvoted a paper 25 days ago

Towards Scalable Pre-training of Visual Tokenizers for Generation

Paper • 2512.13687 • Published 26 days ago • 100

upvoted a paper 26 days ago

SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

Paper • 2512.11749 • Published 29 days ago • 38

upvoted 12 papers about 1 month ago

TreeGRPO: Tree-Advantage GRPO for Online RL Post-Training of Diffusion Models

Paper • 2512.08153 • Published Dec 9, 2025 • 7

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Paper • 2512.08765 • Published Dec 9, 2025 • 128

LongCat-Image Technical Report

Paper • 2512.07584 • Published Dec 8, 2025 • 18

TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows

Paper • 2512.05150 • Published Dec 3, 2025 • 74

Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion

Paper • 2512.04926 • Published Dec 4, 2025 • 41

PixelDiT: Pixel Diffusion Transformers for Image Generation

Paper • 2511.20645 • Published Nov 25, 2025 • 30

RELIC: Interactive Video World Model with Long-Horizon Memory

Paper • 2512.04040 • Published Dec 3, 2025 • 23

Generating an Image From 1,000 Words: Enhancing Text-to-Image With Structured Captions

Paper • 2511.06876 • Published Nov 10, 2025 • 27

Adversarial Flow Models

Paper • 2511.22475 • Published Nov 27, 2025 • 22

Vision Bridge Transformer at Scale

Paper • 2511.23199 • Published Nov 28, 2025 • 45

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 248

Flowing Backwards: Improving Normalizing Flows via Reverse Representation Alignment

Paper • 2511.22345 • Published Nov 27, 2025 • 12

upvoted 4 papers about 2 months ago

DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation

Paper • 2511.19365 • Published Nov 24, 2025 • 64

Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks

Paper • 2511.15065 • Published Nov 19, 2025 • 75

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Paper • 2511.14993 • Published Nov 19, 2025 • 227

Back to Basics: Let Denoising Generative Models Denoise

Paper • 2511.13720 • Published Nov 17, 2025 • 67