Submitted by YuSun-AI 63 ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning · 10 authors 2
Submitted by itaowe 38 SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks · 9 authors 2
Submitted by awojustin 28 VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos · 17 authors 2
Submitted by YunxinLi 27 AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation · 6 authors 4
Submitted by Howe77 18 Domain2Vec: Vectorizing Datasets to Find the Optimal Data Mixture without Training · 4 authors 2
Submitted by dawn0815 17 Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts · 7 authors 2
Submitted by Owen777 15 PosterCraft: Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework · 14 authors 3
Submitted by Ningyu 11 AutoMind: Adaptive Knowledgeable Agent for Automated Data Science · 9 authors 2
Submitted by avery00 11 VideoDeepResearch: Long Video Understanding With Agentic Tool Using · 5 authors 2
Submitted by BiaoGong 11 Ming-Omni: A Unified Multimodal Model for Perception and Generation · 58 authors 2
Submitted by zbrl 10 CreatiPoster: Towards Editable and Controllable Multi-Layer Graphic Design Generation · 9 authors 2
Submitted by Ningyu 9 ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark · 10 authors 2
Submitted by billpsomas 6 Attention, Please! Revisiting Attentive Probing for Masked Image Modeling · 9 authors 2
Submitted by Wesleythu 5 VerIF: Verification Engineering for Reinforcement Learning in Instruction Following · 6 authors 2
Submitted by reach-vb 5 Comment on The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity · 2 authors 2
Submitted by wanglz14 4 DreamActor-H1: High-Fidelity Human-Product Demonstration Video Generation via Motion-designed Diffusion Transformers · 7 authors 2
Submitted by LavenderLA 4 UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting · 4 authors 3
Submitted by Speeeed 4 Compound AI Systems Optimization: A Survey of Methods, Challenges, and Future Directions · 6 authors 2
Submitted by codelion 4 Eliciting Fine-Tuned Transformer Capabilities via Inference-Time Techniques · 1 authors 2
Submitted by vincolle 3 TeleMath: A Benchmark for Large Language Models in Telecom Mathematical Problem Solving · 6 authors 2
Submitted by sayakpaul 2 Fine-Grained Perturbation Guidance via Attention Head Selection · 10 authors 2
Submitted by ordavids1 2 Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization · 3 authors 2
Submitted by benfielding 2 NoLoCo: No-all-reduce Low Communication Training Method for Large Models · 5 authors 2
Submitted by pkargupta 2 TaxoAdapt: Aligning LLM-Based Multidimensional Taxonomy Construction to Evolving Research Corpora · 6 authors 2
Submitted by pkargupta 2 Beyond True or False: Retrieval-Augmented Hierarchical Analysis of Nuanced Claims · 3 authors 2
Submitted by JJ-TMT 2 Breaking Data Silos: Towards Open and Scalable Mobility Foundation Models via Generative Continual Learning · 5 authors 2
Submitted by hlzhang109 1 Discovering Hierarchical Latent Capabilities of Language Models via Causal Representation Learning · 4 authors 2
Submitted by Franck-Dernoncourt 1 LaMP-Cap: Personalized Figure Caption Generation With Multimodal Figure Profiles · 11 authors 2
Submitted by yiren98 1 MCA-Bench: A Multimodal Benchmark for Evaluating CAPTCHA Robustness Against VLM-based Attacks · 4 authors 2
Submitted by xinjjj - EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence · 8 authors 2
Submitted by Nickwzk - StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams · 5 authors 2