SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 8 days ago • 158
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published 27 days ago • 46
SOAP: Improving and Stabilizing Shampoo using Adam Paper • 2409.11321 • Published Sep 17, 2024 • 1
Small Models Struggle to Learn from Strong Reasoners Paper • 2502.12143 • Published Feb 17 • 34
Granite Data Collection This collection has a set of artifacts which are related to curating and evaluating datasets used for Granite models • 16 items • Updated Feb 28 • 4
view article Article Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥 Feb 18 • 96