Auto-Regressive vs Flow-Matching: a Comparative Study of Modeling Paradigms for Text-to-Music Generation Paper • 2506.08570 • Published 5 days ago • 27
Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation Paper • 2506.09350 • Published 4 days ago • 45
Seedance 1.0: Exploring the Boundaries of Video Generation Models Paper • 2506.09113 • Published 4 days ago • 62
Have we unified image generation and understanding yet? An empirical study of GPT-4o's image generation ability Paper • 2504.08003 • Published Apr 9 • 49
Scaling Up Dataset Distillation to ImageNet-1K with Constant Memory Paper • 2211.10586 • Published Nov 19, 2022 • 1
R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model Paper • 2503.05132 • Published Mar 7 • 58
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs Paper • 2406.16860 • Published Jun 24, 2024 • 61