GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents Paper β’ 2604.26752 β’ Published 13 days ago β’ 104
UniDoc-RL: Coarse-to-Fine Visual RAG with Hierarchical Actions and Dense Rewards Paper β’ 2604.14967 β’ Published 26 days ago β’ 15