Large Language Models Can Self-Improve in Long-context Reasoning Paper β’ 2411.08147 β’ Published 5 days ago β’ 47
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 8 items β’ Updated 13 days ago β’ 167
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss Paper β’ 2410.17243 β’ Published 26 days ago β’ 88
ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting Paper β’ 2410.17856 β’ Published 25 days ago β’ 49
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree Paper β’ 2410.16268 β’ Published 27 days ago β’ 65
FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors Paper β’ 2410.16271 β’ Published 27 days ago β’ 80
steiner-preview Collection Reasoning models trained on synthetic data using reinforcement learning. β’ 3 items β’ Updated 28 days ago β’ 23
Granite 3.0 Language Models Collection A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. β’ 8 items β’ Updated 13 days ago β’ 87
VidEgoThink: Assessing Egocentric Video Understanding Capabilities for Embodied AI Paper β’ 2410.11623 β’ Published Oct 15 β’ 46
HumanEval-V: Evaluating Visual Understanding and Reasoning Abilities of Large Multimodal Models Through Coding Tasks Paper β’ 2410.12381 β’ Published Oct 16 β’ 41
Movie Gen: A Cast of Media Foundation Models Paper β’ 2410.13720 β’ Published about 1 month ago β’ 88
MobA: A Two-Level Agent System for Efficient Mobile Task Automation Paper β’ 2410.13757 β’ Published about 1 month ago β’ 31
MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures Paper β’ 2410.13754 β’ Published about 1 month ago β’ 74
HelpSteer2-Preference: Complementing Ratings with Preferences Paper β’ 2410.01257 β’ Published Oct 2 β’ 19
Llama-3.1-Nemotron-70B Collection SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. β’ 6 items β’ Updated Oct 15 β’ 136
Aria: An Open Multimodal Native Mixture-of-Experts Model Paper β’ 2410.05993 β’ Published Oct 8 β’ 107
WALL-E: World Alignment by Rule Learning Improves World Model-based LLM Agents Paper β’ 2410.07484 β’ Published Oct 9 β’ 48
DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data Paper β’ 2405.14333 β’ Published May 23 β’ 35