Sirui Zhang

zsr200901

AI & ML interests

None yet

Recent Activity

liked a dataset 9 days ago

k-mktr/improved-flux-prompts-photoreal-portrait

liked a dataset 9 days ago

conorcl/portraits-512

upvoted a paper 23 days ago

Distribution Matching Variational AutoEncoder

View all activity

Organizations

liked 2 datasets 9 days ago

k-mktr/improved-flux-prompts-photoreal-portrait

Viewer • Updated Oct 3, 2024 • 20k • 745 • 115

conorcl/portraits-512

Viewer • Updated Dec 30, 2022 • 2.92k • 25 • 7

upvoted a paper 23 days ago

Distribution Matching Variational AutoEncoder

Paper • 2512.07778 • Published 26 days ago • 28

upvoted a paper 29 days ago

Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion

Paper • 2512.04926 • Published about 1 month ago • 41

upvoted a paper about 1 month ago

Back to Basics: Let Denoising Generative Models Denoise

Paper • 2511.13720 • Published Nov 17, 2025 • 67

upvoted a paper about 2 months ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 211

upvoted 2 papers 2 months ago

DeepSeek-OCR: Contexts Optical Compression

Paper • 2510.18234 • Published Oct 21, 2025 • 86

RL makes MLLMs see better than SFT

Paper • 2510.16333 • Published Oct 18, 2025 • 48

updated a collection 4 months ago

VLA

Collection

2 items • Updated Sep 15, 2025

upvoted a paper 4 months ago

R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning

Paper • 2508.21113 • Published Aug 28, 2025 • 110

upvoted 4 papers 5 months ago

The Promise of RL for Autoregressive Image Editing

Paper • 2508.01119 • Published Aug 1, 2025 • 11

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published Aug 2, 2025 • 238

EDGE-GRPO: Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity

Paper • 2507.21848 • Published Jul 29, 2025 • 8

Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis

Paper • 2507.23785 • Published Jul 31, 2025 • 18

upvoted a collection 5 months ago

ReasonGen-R1

Collection

Model and Datasets for the paper "ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL • 7 items • Updated Jun 2, 2025 • 6

upvoted 3 papers 5 months ago

X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again

Paper • 2507.22058 • Published Jul 29, 2025 • 39

Phi-Ground Tech Report: Advancing Perception in GUI Grounding

Paper • 2507.23779 • Published Jul 31, 2025 • 44

Pixels, Patterns, but No Poetry: To See The World like Humans

Paper • 2507.16863 • Published Jul 21, 2025 • 68

upvoted 2 papers 9 months ago

DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published Apr 8, 2025 • 77

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 122

Sirui Zhang

AI & ML interests

Recent Activity

Organizations

zsr200901's activity