Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper β’ 2503.16219 β’ Published Mar 20 β’ 51
Open-RS Collection Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesnβt" β’ 8 items β’ Updated Mar 21 β’ 12
view article Article Mixture of Experts Explained By osanseviero and 5 others β’ Dec 11, 2023 β’ 672
XiYanSQL Models Collection The XiYanSQL series are foundational SQL models available in various sizes, including 3B, 7B, 14B, and 32B. β’ 8 items β’ Updated 15 days ago β’ 7