Submitted by zichenwen 54 The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs · 14 authors 45 2
Submitted by korallll 46 A Data-Centric Framework for Addressing Phonetic and Prosodic Challenges in Russian Speech Generative Models · 7 authors 9 2
Submitted by yukimasano 19 Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning · 8 authors 117 3
Submitted by nqbinh 18 CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models · 5 authors 4
Submitted by Holarissun 13 Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities · 2 authors 1
Submitted by wzk1015 12 Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal Large Language Models · 12 authors 61 1
Submitted by shikhar7ssu 5 OpenBEATs: A Fully Open-Source General-Purpose Audio Encoder · 7 authors 1
Submitted by psp-dada 5 Mitigating Object Hallucinations via Sentence-Level Early Intervention · 4 authors 5 1
Submitted by Hiiamein 5 RedOne: Revealing Domain-specific LLM Post-Training in Social Networking Services · 25 authors 2
Submitted by gonzmart 4 The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations · 5 authors 1
Submitted by 0xnu 3 Quantitative Risk Management in Volatile Markets with an Expectile-Based Framework for the FTSE Index · 1 authors 0 1