🔄 In a Training Loop

Lewis Tunstall PRO

lewtun

huggingface

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

liked a Space about 15 hours ago

joelniklaus/harness-optimization

liked a model about 16 hours ago

thinkingmachines/Inkling

upvoted an article 3 days ago

Native-speed vLLM transformers modeling backend

View all activity

Organizations

buckets 59

lewtun/trl-internal-testing

lewtun/sft-static-9f352b-bucket

lewtun/sft-static-c286a8-bucket

lewtun/distillation-static-b3230e-bucket

lewtun/dpo-static-9dc680-bucket

lewtun/sft-static-c8c13e-bucket

View 59 buckets

Posts 8

Post

5190

Introducing OlympicCoder: a series of open reasoning models that can solve olympiad-level programming problems 🧑‍💻

- 7B open-r1/OlympicCoder-7B
- 32B open-r1/OlympicCoder-32B

We find that OlympicCoder models outperform Claude 3.7 Sonnet, as well as others over 100x larger 💪

Together with the models, we are releasing:

📊CodeForces-CoTs: new dataset of code problems from the most popular competitive coding platform, with R1 traces in C++ and Python open-r1/codeforces-cots

🏆 IOI'2024: a new benchmark of VERY hard programming problems where even frontier models struggle to match human performance open-r1/ioi

For links to the models and datasets, check out our latest progress report from Open R1: https://huggingface.co/blog/open-r1/update-3

Articles 41

Article

106

The Open Source Community is backing OpenEnv for Agentic RL

View all Articles

Collections 6

View 6 collections

Papers 11

arxiv:2504.11354

arxiv:2504.05299

arxiv:2503.07572

arxiv:2502.02737

spaces 121

Sft Static 9f352b

View and explore your data with an interactive dashboard

Sft Static C286a8

View and explore your data in an interactive dashboard

Distillation Static B3230e

Monitor your projects with an interactive dashboard

Dpo Static 9dc680

View your project's tracking dashboard in real time

Sft Static C8c13e

View your data insights on an interactive dashboard

Sft Static 034c12

View and explore your tracked data in an interactive dashboard

View 121 Spaces

models 324

lewtun/qwen3-0.6b-wordle-grpo

Text Generation • 0.6B • Updated 28 days ago • 39

lewtun/Qwen3-4B-Capybara-SFT

Text Generation • 4B • Updated 28 days ago • 105 • 1

lewtun/qwen3-4b-capybara

Text Generation • 4B • Updated 28 days ago • 117 • 1

lewtun/qwen3-0.6b-capybara-smoke

Text Generation • 0.6B • Updated Jun 8 • 8

lewtun/qwen3-0.6b-capybara

Text Generation • 0.6B • Updated Jun 7 • 5

lewtun/qwen3-0.6b-capybara-1step

Text Generation • 0.6B • Updated Jun 7 • 7

lewtun/qwen3-0.6b-angrygiraffe-sft

Text Generation • 0.6B • Updated Jun 3 • 16

lewtun/qwen3-4b-hermes-tooluse

Text Generation • 4B • Updated Jun 3 • 11

lewtun/qwen3-0.6b-sft-capybara

Text Generation • 0.6B • Updated May 12 • 8

lewtun/smollm2-1.7b-capybara-sft

View 324 models

datasets 96

lewtun/ml-intern-sessions

Updated 21 days ago • 988 • 3

lewtun/capybara-25-20260507

Viewer • Updated May 7 • 25 • 4

lewtun/capybara-25-20260506

Viewer • Updated May 6 • 25 • 12

lewtun/capybara-25

Viewer • Updated May 6 • 25 • 17

lewtun/capybara-100-2026-05-05

Viewer • Updated May 5 • 100 • 17

lewtun/capybara-100-test-2026-05-05

Updated May 5 • 9

lewtun/openthoughts-100

Updated May 5 • 17

lewtun/Capybara-100

Viewer • Updated May 5 • 100 • 23

lewtun/running-dashboard-data

Viewer • Updated May 3 • 16 • 35

lewtun/dolci-think-sft-6400

Viewer • Updated Mar 11 • 6.4k • 17

View 96 datasets