5 24 9

Siyuan Li

Lupin1998

https://lupin1998.github.io/

AI & ML interests

Network Design, Self-supervised Learning, Computer Vision, Data-centric ML, AI for Science

Recent Activity

upvoted a paper about 1 month ago

Seedance 1.0: Exploring the Boundaries of Video Generation Models

upvoted a paper about 1 month ago

PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models

upvoted a paper about 1 month ago

Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning

View all activity

Organizations

upvoted 5 papers about 1 month ago

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published Jun 10 • 98

PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models

Paper • 2506.16054 • Published Jun 19 • 60

Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning

Paper • 2506.13654 • Published Jun 16 • 43

Marrying Autoregressive Transformer and Diffusion with Multi-Reference Autoregression

Paper • 2506.09482 • Published Jun 11 • 46

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 259

upvoted 2 papers about 2 months ago

Adversarial AutoMixup

Paper • 2312.11954 • Published Dec 19, 2023 • 4

Taming LLMs by Scaling Learning Rates with Gradient Grouping

Paper • 2506.01049 • Published Jun 1 • 37

upvoted a paper 2 months ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17 • 120

upvoted a paper 3 months ago

XAttention: Block Sparse Attention with Antidiagonal Scoring

Paper • 2503.16428 • Published Mar 20 • 14

upvoted 11 papers 4 months ago

From Words to Structured Visuals: A Benchmark and Framework for Text-to-Diagram Generation and Editing

Paper • 2411.11916 • Published Nov 18, 2024 • 3

Peer Review as A Multi-Turn and Long-Context Dialogue with Role-Based Interactions

Paper • 2406.05688 • Published Jun 9, 2024 • 1

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18, 2024 • 78

Multi-Token Attention

Paper • 2504.00927 • Published Apr 1 • 54

MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

Paper • 2504.00999 • Published Apr 1 • 93

Siyuan Li

AI & ML interests

Recent Activity

Organizations

Lupin1998's activity