Submitted by roadjiang 59 Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model · 54 authors 6
Submitted by YuuTennYi 23 GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation · 5 authors 1
Submitted by BestWishYsh 12 MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft · 7 authors 2
Submitted by ZhuangXialie 6 SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning · 6 authors 1
Submitted by BestWishYsh 6 FlexIP: Dynamic Control of Preservation and Personality for Customized Image Generation · 4 authors 1
Submitted by sauradip 4 In-2-4D: Inbetweening from Two Single-View Images to 4D Generation · 4 authors 1
Submitted by stefan-it 3 ModernBERT or DeBERTaV3? Examining Architecture and Data Influence on Transformer Encoder Models Performance · 3 authors 1
Submitted by aashiqmuhamed 2 SAEs Can Improve Unlearning: Dynamic Sparse Autoencoder Guardrails for Precision Unlearning in LLMs · 4 authors 1
Submitted by DannyLan 1 Do PhD-level LLMs Truly Grasp Elementary Addition? Probing Rule Learning vs. Memorization in Large Language Models · 4 authors 2
Submitted by AdinaY - Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs · 52 authors 2
Submitted by saidwivedi - InteractVLM: 3D Interaction Reasoning from 2D Foundational Models · 7 authors 1