view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • 1 day ago • 364
🧠 SmolLM3 Collection Smol, multilingual, long-context reasoner • 9 items • Updated about 6 hours ago • 39
ERNIE 4.5 Collection collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 23 items • Updated 6 days ago • 144
Comment on The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity Paper • 2506.09250 • Published 29 days ago • 28
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper • 2506.05209 • Published Jun 5 • 42
One-RL-to-See-Them-All Collection One RL to See Them All: Visual Triple Unified Reinforcement Learning. GitHub: https://github.com/MiniMax-AI/One-RL-to-See-Them-All • 5 items • Updated 29 days ago • 27
Distilling LLM Agent into Small Models with Retrieval and Code Tools Paper • 2505.17612 • Published May 23 • 79
Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents Paper • 2505.22954 • Published May 29 • 12
view article Article 🌙 Introducing **Moon**: Storytelling Generator Model By kulia-moon and 1 other • May 30 • 6
Exploring the Latent Capacity of LLMs for One-Step Text Generation Paper • 2505.21189 • Published May 27 • 62
Alchemist: Turning Public Text-to-Image Data into Generative Gold Paper • 2505.19297 • Published May 25 • 81
view article Article Bigger isn't always better: how to choose the most efficient model for context-specific tasks 🌱🧑🏼💻 By sasha • May 28 • 21