Open-RS Collection Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t" • 8 items • Updated 4 days ago • 10
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published 4 days ago • 38
Gemma 3 Collection All versions of Google's new multimodal models in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 29 items • Updated 9 days ago • 43
view article Article LeRobot goes to driving school: World’s largest open-source self-driving dataset 14 days ago • 67
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 13 days ago • 342