miniCTX Collection miniCTX: Neural Theorem Proving with (Long-)Contexts (ICLR 2025 Oral) • 8 items • Updated Mar 19 • 2
L1 Collection L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning • 7 items • Updated Jul 13 • 7
OpenScholar_V1 Collection The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". • 8 items • Updated Nov 22, 2024 • 40
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 9 items • Updated Jul 14 • 63
OpenCoder Collection OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 8 items • Updated Nov 23, 2024 • 85
TxGemma Release Collection Collection of open models to accelerate the development of therapeutics. • 5 items • Updated Jul 10 • 62
Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published Nov 20, 2024 • 46
view article Article Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model By VictorSanh and 10 others • Aug 22, 2023 • 36
Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated Apr 30 • 88
view article Article Open-source DeepResearch – Freeing our search agents By m-ric and 4 others • Feb 4 • 1.28k