Submitted by jt-zhang 49 SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training · 9 authors 2
Submitted by QPHutu 30 Optimizing Anytime Reasoning via Budget Relative Policy Optimization · 6 authors 2
Submitted by TianheWu 28 VisualQuality-R1: Reasoning-Induced Image Quality Assessment via Reinforcement Learning to Rank · 5 authors 3
Submitted by dariog 20 The Aloe Family Recipe for Open and Specialized Healthcare LLMs · 13 authors 2
Submitted by jiwonsong 13 Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning · 4 authors 2
Submitted by kaiyangzhou 12 Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning · 5 authors 2
Submitted by kaiyangzhou 11 Training-Free Watermarking for Autoregressive Image Generation · 4 authors 2
Submitted by wren93 11 VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation · 7 authors 2
Submitted by akhaliq 10 Hunyuan-Game: Industrial-grade Intelligent Game Creation Model · 50 authors 2
Submitted by SkAndMl 10 CS-Sum: A Benchmark for Code-Switching Dialogue Summarization and the Limits of Large Language Models · 4 authors 3
Submitted by Ningyu 9 Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training · 15 authors 2
Submitted by KID-22 9 NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search · 7 authors 2
Submitted by kaiyangzhou 9 Fine-tuning Quantized Neural Networks with Zeroth-order Optimization · 5 authors 2
Submitted by Emperorizzis 8 Not All Correct Answers Are Equal: Why Your Distillation Source Matters · 8 authors 2
Submitted by huangsiteng 8 SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning · 8 authors 2
Submitted by bcywinski 7 Towards eliciting latent knowledge from LLMs with mechanistic interpretability · 4 authors 2
Submitted by tiantiaf 6 Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits · 12 authors 2
Submitted by iliashum 6 Lessons from Defending Gemini Against Indirect Prompt Injections · 14 authors 2
Submitted by safal312 6 Warm Up Before You Train: Unlocking General Reasoning in Resource-Constrained Settings · 5 authors 2
Submitted by sliuxl 6 MIGRATION-BENCH: Repository-Level Code Migration Benchmark from Java 8 · 11 authors 2
Submitted by DavidNguyen 5 CompeteSMoE -- Statistically Guaranteed Mixture of Experts Training via Competition · 6 authors 2
Submitted by Jianyuan1 4 Solve-Detect-Verify: Inference-Time Scaling with Flexible Generative Verifier · 6 authors 2
Submitted by KomeijiForce 3 Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection · 8 authors 2
Submitted by xianghe 3 Incorporating brain-inspired mechanisms for multimodal learning in artificial intelligence · 6 authors 2
Submitted by kellycyy 2 Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas · 7 authors 2
Submitted by Wyattz23 2 Tokenization Constraints in LLMs: A Study of Symbolic and Arithmetic Reasoning Limits · 5 authors 2
Submitted by charleslipku 2 CoIn: Counting the Invisible Reasoning Tokens in Commercial Opaque LLM APIs · 10 authors 2
Submitted by Jia-py 2 GeoRanker: Distance-Aware Ranking for Worldwide Image Geolocalization · 5 authors 2
Submitted by himel7 2 To Bias or Not to Bias: Detecting bias in News with bias-detector · 3 authors 2
Submitted by hwy9855 2 Masking in Multi-hop QA: An Analysis of How Language Models Perform with Context Permutation · 4 authors 2
Submitted by hmarkc 2 Rethinking Optimal Verification Granularity for Compute-Efficient Test-Time Scaling · 6 authors 2
Submitted by mohbattharani 1 KERL: Knowledge-Enhanced Personalized Recipe Recommendation using Large Language Models · 2 authors 2
Submitted by Marl 1 Dynadiff: Single-stage Decoding of Images from Continuously Evolving fMRI · 3 authors 2
Submitted by Veeru 1 Understanding Gen Alpha Digital Language: Evaluation of LLM Safety Systems for Content Moderation · 2 authors 2
Submitted by jwgcurrie - Towards Embodied Cognition in Robots via Spatially Grounded Synthetic Worlds · 7 authors 2
Submitted by Beegbrain - Object-Centric Representations Improve Policy Generalization in Robot Manipulation · 4 authors 2
Submitted by florin-hf - The Distracting Effect: Understanding Irrelevant Passages in RAG · 4 authors 2