ComfyUI-R1: Exploring Reasoning Models for Workflow Generation Paper • 2506.09790 • Published 15 days ago • 51
Seedance 1.0: Exploring the Boundaries of Video Generation Models Paper • 2506.09113 • Published 16 days ago • 90
Emerging Properties in Unified Multimodal Pretraining Paper • 2505.14683 • Published May 20 • 130 • 4
Unmasked Teacher: Towards Training-Efficient Video Foundation Models Paper • 2303.16058 • Published Mar 28, 2023
Harvest Video Foundation Models via Efficient Post-Pretraining Paper • 2310.19554 • Published Oct 30, 2023
MVBench: A Comprehensive Multi-modal Video Understanding Benchmark Paper • 2311.17005 • Published Nov 28, 2023 • 2
Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks Paper • 2401.14159 • Published Jan 25, 2024 • 3
UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning Paper • 2201.04676 • Published Jan 12, 2022