Submitted by lx865712528 42 Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models · 34 authors 2
Submitted by BestWishYsh 33 Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step · 7 authors 2
Submitted by KairuiHu 22 Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos · 8 authors 2
Submitted by s-emanuilov 21 Temporal Preference Optimization for Long-Form Video Understanding · 5 authors 3
Submitted by yentinglin 14 Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback · 14 authors 3
Submitted by s-emanuilov 13 IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models · 15 authors 2
Submitted by shuzyuan 9 Hallucinations Can Improve Large Language Models in Drug Discovery · 2 authors 8
Submitted by byliutao 9 One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt · 9 authors 2
Submitted by BestWishYsh 7 EchoVideo: Identity-Preserving Human Video Generation by Multimodal Feature Fusion · 6 authors 2
Submitted by hawei 6 Control LLM: Controlled Evolution for Intelligence Retention in LLM · 7 authors 2