Seedance 1.0: Exploring the Boundaries of Video Generation Models Paper • 2506.09113 • Published 23 days ago • 91
PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models Paper • 2506.16054 • Published 14 days ago • 57
Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning Paper • 2506.13654 • Published 17 days ago • 42
Marrying Autoregressive Transformer and Diffusion with Multi-Reference Autoregression Paper • 2506.09482 • Published 22 days ago • 46
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention Paper • 2506.13585 • Published 17 days ago • 249
SimVPv2: Towards Simple yet Powerful Spatiotemporal Predictive Learning Paper • 2211.12509 • Published Nov 22, 2022
Taming LLMs by Scaling Learning Rates with Gradient Grouping Paper • 2506.01049 • Published Jun 1 • 36
Taming LLMs by Scaling Learning Rates with Gradient Grouping Paper • 2506.01049 • Published Jun 1 • 36 • 4
Taming LLMs by Scaling Learning Rates with Gradient Grouping Paper • 2506.01049 • Published Jun 1 • 36 • 4
Taming LLMs by Scaling Learning Rates with Gradient Grouping Paper • 2506.01049 • Published Jun 1 • 36
XAttention: Block Sparse Attention with Antidiagonal Scoring Paper • 2503.16428 • Published Mar 20 • 14
From Words to Structured Visuals: A Benchmark and Framework for Text-to-Diagram Generation and Editing Paper • 2411.11916 • Published Nov 18, 2024 • 3
Peer Review as A Multi-Turn and Long-Context Dialogue with Role-Based Interactions Paper • 2406.05688 • Published Jun 9, 2024 • 1
OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning Paper • 2306.11249 • Published Jun 20, 2023 • 2
OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning Paper • 2306.11249 • Published Jun 20, 2023 • 2