Seedance 1.0: Exploring the Boundaries of Video Generation Models Paper • 2506.09113 • Published 1 day ago • 37
Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning Paper • 2506.06205 • Published 6 days ago • 27
STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis Paper • 2506.06276 • Published 6 days ago • 18
Differentiable Solver Search for Fast Diffusion Sampling Paper • 2505.21114 • Published 16 days ago • 10
Emerging Properties in Unified Multimodal Pretraining Paper • 2505.14683 • Published 23 days ago • 130
Fast Text-to-Audio Generation with Adversarial Post-Training Paper • 2505.08175 • Published about 1 month ago • 22
DanceGRPO: Unleashing GRPO on Visual Generation Paper • 2505.07818 • Published about 1 month ago • 29
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published Feb 20 • 145
Step1X-Edit: A Practical Framework for General Image Editing Paper • 2504.17761 • Published Apr 24 • 88
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Paper • 2504.13837 • Published Apr 18 • 128