view article Article LeRobot Community Datasets: The “ImageNet” of Robotics — When and How? 1 day ago • 24
HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation Paper • 2504.21650 • Published 12 days ago • 13
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models Paper • 2505.04921 • Published 4 days ago • 115
OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning Paper • 2505.04601 • Published 4 days ago • 16
Multi-Agent System for Comprehensive Soccer Understanding Paper • 2505.03735 • Published 5 days ago • 18
Qwen3-Quantization Collection This is the official quantized models collection of Qwen3 Quantization • 42 items • Updated 5 days ago • 5
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities Paper • 2505.02567 • Published 7 days ago • 64
FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models Paper • 2505.02735 • Published 7 days ago • 27
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning Paper • 2505.03318 • Published 6 days ago • 85
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers Paper • 2504.20752 • Published 13 days ago • 81
Improving Editability in Image Generation with Layer-wise Memory Paper • 2505.01079 • Published 10 days ago • 27
T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT Paper • 2505.00703 • Published 10 days ago • 39
UniversalRAG: Retrieval-Augmented Generation over Multiple Corpora with Diverse Modalities and Granularities Paper • 2504.20734 • Published 13 days ago • 61
Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper • 2504.20571 • Published 13 days ago • 90