M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models Paper • 2504.10449 • Published 12 days ago • 10
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 19 days ago • 172
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 Mar 26 • 120
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published Mar 20 • 48
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM Mar 12 • 400
Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch Paper • 2501.18512 • Published Jan 30 • 30
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 386
FAST: Efficient Action Tokenization for Vision-Language-Action Models Paper • 2501.09747 • Published Jan 16 • 24
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8 • 277
Monolith: Real Time Recommendation System With Collisionless Embedding Table Paper • 2209.07663 • Published Sep 16, 2022 • 1
Human-Timescale Adaptation in an Open-Ended Task Space Paper • 2301.07608 • Published Jan 18, 2023 • 1