Submitted by tytyt 40 OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion · 10 authors 1
Submitted by happzy2633 34 CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization · 19 authors 16 1
Submitted by taiwang 33 StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling · 12 authors 76 2
Submitted by judge 27 RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents · 16 authors 2
Submitted by wangrongsheng 22 MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos · 7 authors 16 1
Submitted by yxlu0 17 Is Diversity All You Need for Scalable Robotic Manipulation? · 10 authors 2.17k 1
Submitted by guokan-shang 15 Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts · 10 authors 1
Submitted by zsytony 14 Coding Triangle: How Does Large Language Model Understand Code? · 6 authors 1
Submitted by songtingyu 11 Efficiency-Effectiveness Reranking FLOPs for LLM-based Rerankers · 5 authors 1
Submitted by acharkq 10 PRING: Rethinking Protein-Protein Interaction Prediction from Pairs to Graphs · 12 authors 1
Submitted by songdj 10 SAMed-2: Selective Memory Enhanced Medical Segment Anything Model · 14 authors 1
Submitted by BestWishYsh 8 Tora2: Motion and Appearance Customized Diffusion Transformer for Multi-Entity Video Generation · 5 authors 1
Submitted by ZetangForward 8 LOOM-Scope: a comprehensive and efficient LOng-cOntext Model evaluation framework · 8 authors 11 1
Submitted by xinyu1205 6 High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning · 6 authors 1
Submitted by Xuandong 4 The Landscape of Memorization in LLMs: Mechanisms, Measurement, and Mitigation · 4 authors 1
Submitted by ChristophReich1996 3 Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion · 7 authors 10 2
Submitted by nielsr - AXLearn: Modular Large Model Training on Heterogeneous Infrastructure · 37 authors 2.12k 1