view article Article LeRobot Community Datasets: The “ImageNet” of Robotics — When and How? 1 day ago • 24
HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation Paper • 2504.21650 • Published 12 days ago • 13
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models Paper • 2505.04921 • Published 4 days ago • 115
OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning Paper • 2505.04601 • Published 4 days ago • 16
Multi-Agent System for Comprehensive Soccer Understanding Paper • 2505.03735 • Published 5 days ago • 18
Qwen3-Quantization Collection This is the official quantized models collection of Qwen3 Quantization • 42 items • Updated 5 days ago • 5
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities Paper • 2505.02567 • Published 7 days ago • 64