🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 4 items • Updated about 21 hours ago • 7
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 8 days ago • 263
Scaling Test-Time Compute with Open Models Collection Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated 24 days ago • 22
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated 22 days ago • 549
Llama 3.3 Collection This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 123
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated 24 days ago • 55
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated 24 days ago • 292
Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 1 day ago • 64
OLMoE Collection Artifacts for open mixture-of-experts language models. • 13 items • Updated 24 days ago • 29