Submitted by wenyi 154 GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning · 77 authors 480 3
Submitted by yuexiang96 41 Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning · 9 authors 20 2
Submitted by yilunzhao 35 SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks · 18 authors 28 2
Submitted by Haon-Chen 32 MoCa: Modality-aware Continual Pre-training Makes Better Bidirectional Multimodal Embeddings · 7 authors 36 1
Submitted by Lmxyy 31 Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation · 14 authors 248 3
Submitted by Sansa 17 DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation · 7 authors 257 1
Submitted by RanjanSapkota 11 Thinking Beyond Tokens: From Brain-Inspired Intelligence to Cognitive Foundations for Artificial General Intelligence and its Societal Impact · 20 authors 4
Submitted by fushh7 10 HumanOmniV2: From Understanding to Omni-Modal Reasoning with Context · 10 authors 18 1
Submitted by Amar-S 9 Training for X-Ray Vision: Amodal Segmentation, Amodal Content Completion, and View-Invariant Object Representation from Multi-Camera Video · 5 authors 1
Submitted by puar-playground 8 MusiXQA: Advancing Visual Music Understanding in Multimodal Large Language Models · 9 authors 1
Submitted by Simase 5 FreeLong++: Training-Free Long Video Generation via Multi-band SpectralFusion · 2 authors 1
Submitted by AdinaY 5 IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering · 10 authors 28 1
Submitted by amanchadha 4 Peccavi: Visual Paraphrase Attack Safe and Distortion Free Image Watermarking Technique for AI-Generated Images · 7 authors 1
Submitted by huxueyu 3 Mixture of Reasonings: Teach Large Language Models to Reason with Adaptive Strategies · 4 authors 1
Submitted by Peter2023HuggingFace 1 FreNBRDF: A Frequency-Rectified Neural Material Representation · 3 authors 4 1
Submitted by AmirHossein-razlighi 1 Confident Splatting: Confidence-Based Compression of 3D Gaussian Splatting via Learnable Beta Distributions · 3 authors 1