MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft Paper • 2504.08388 • Published 5 days ago • 37
GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation Paper • 2504.08736 • Published 5 days ago • 39
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model Paper • 2504.08685 • Published 5 days ago • 104
Clinical ModernBERT: An efficient and long context encoder for biomedical text Paper • 2504.03964 • Published 11 days ago • 5
Why Reasoning Matters? A Survey of Advancements in Multimodal Reasoning (v1) Paper • 2504.03151 • Published 12 days ago • 12
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought Paper • 2504.05599 • Published 8 days ago • 77
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published 8 days ago • 141
T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models Paper • 2504.04718 • Published 9 days ago • 38
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 9 days ago • 158
Open Deep Search: Democratizing Search with Open-source Reasoning Agents Paper • 2503.20201 • Published 21 days ago • 44
ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation Paper • 2503.21729 • Published 20 days ago • 27