21 43 22

Zhenxiong Tan PRO

Yuanshi

AI & ML interests

Reinforcement Learning; Large Language Model; Multimodality; AI Infrastructure;

Recent Activity

upvoted a paper 2 days ago

Q-ARVD: Quantizing Autoregressive Video Diffusion Models

upvoted a paper 3 days ago

Mix-Quant: Quantized Prefilling, Precise Decoding for Agentic LLMs

upvoted a paper 6 days ago

ReactiveGWM: Steering NPC in Reactive Game World Models

View all activity

Organizations

upvoted a paper 2 days ago

Q-ARVD: Quantizing Autoregressive Video Diffusion Models

Paper • 2605.21072 • Published 4 days ago • 19

upvoted a paper 3 days ago

Mix-Quant: Quantized Prefilling, Precise Decoding for Agentic LLMs

Paper • 2605.20315 • Published 5 days ago • 28

upvoted a paper 6 days ago

ReactiveGWM: Steering NPC in Reactive Game World Models

Paper • 2605.15256 • Published 10 days ago • 28

upvoted a paper 23 days ago

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Paper • 2604.28185 • Published 24 days ago • 90

upvoted a paper 26 days ago

Vision-Language-Action Safety: Threats, Challenges, Evaluations, and Mechanisms

Paper • 2604.23775 • Published 28 days ago • 45

upvoted a paper about 1 month ago

DMax: Aggressive Parallel Decoding for dLLMs

Paper • 2604.08302 • Published Apr 9 • 52

upvoted 2 papers about 2 months ago

Gated Condition Injection without Multimodal Attention: Towards Controllable Linear-Attention Transformers

Paper • 2603.27666 • Published Mar 29 • 18

Make Geometry Matter for Spatial Reasoning

Paper • 2603.26639 • Published Mar 27 • 32

upvoted 3 papers 2 months ago

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

Paper • 2603.15726 • Published Mar 16 • 186

Anatomy of a Lie: A Multi-Stage Diagnostic Framework for Tracing Hallucinations in Vision-Language Models

Paper • 2603.15557 • Published Mar 16 • 29

ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer

Paper • 2603.15478 • Published Mar 16 • 24

upvoted a paper 3 months ago

dVoting: Fast Voting for dLLMs

Paper • 2602.12153 • Published Feb 12 • 22

upvoted a paper 4 months ago

SliderEdit: Continuous Image Editing with Fine-Grained Instruction Control

Paper • 2511.09715 • Published Nov 12, 2025 • 11

upvoted 3 papers 5 months ago

upvoted 4 papers 6 months ago

Vision Bridge Transformer at Scale

Paper • 2511.23199 • Published Nov 28, 2025 • 47

Computer-Use Agents as Judges for Generative User Interface

Paper • 2511.15567 • Published Nov 19, 2025 • 54

In-Video Instructions: Visual Signals as Generative Control

Paper • 2511.19401 • Published Nov 24, 2025 • 32

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14, 2025 • 195

Zhenxiong Tan PRO

AI & ML interests

Recent Activity

Organizations

Yuanshi's activity