Lewis Tunstall PRO
lewtun
AI & ML interests
LLMs, LLMs, LLMs
Recent Activity
updated
a collection
about 9 hours ago
🧠 Reasoning datasets
updated
a dataset
about 15 hours ago
open-r1/codeforces-cots
updated
a dataset
about 15 hours ago
open-r1/codeforces-cots
Organizations
lewtun's activity
about <think> and </think>
2
#9 opened 5 days ago
by
volcanos

Fix typo in dataset load
#3 opened 10 days ago
by
lewtun

Please add HF Inference Endpoint and library tags which allow easier deployment
1
#8 opened 11 days ago
by
SolshineMisfit

Mode changed to Model
2
#7 opened 11 days ago
by
Solshine

Update README.md
1
#6 opened 12 days ago
by
nickname100231
Omitted <think> at the start and almost 10k tokens to debug 2 JS functions
3
#2 opened 15 days ago
by
operationdarkside
It seems to overthink
1
#3 opened 13 days ago
by
sm54
Upload dataset
#4 opened 12 days ago
by
lewtun

missing </think> in all subset
2
#3 opened 12 days ago
by
volcanos

Why is there a discrepancy between the 'Solutions' subset and the 'Solutions_py' subset?
1
#2 opened 16 days ago
by
waple

Trouble loading the dataset
2
#2 opened 16 days ago
by
lewtun

Update README.md
1
#1 opened 17 days ago
by
lhoestq

Size of the weights > 140 GB for a 32 GB model?
1
#2 opened 17 days ago
by
stelterlab

Remove fp32 weights
#4 opened 17 days ago
by
lewtun

Remove fp32 weights
#3 opened 17 days ago
by
lewtun

[Paper review] Small Models Struggle to Learn from Strong Reasoners
#19 opened about 1 month ago
by
lewtun

⚠️ Chat template foot gun with DeepSeek distilled models and RL format reward function
6
#17 opened about 1 month ago
by
lewtun

the finetune config of open-r1?
2
#6 opened about 1 month ago
by
MilyFang
Update README.md
3
#1 opened about 2 months ago
by
davidberenstein1957

[Experiment] Applying GRPO to DeepSeek-R1-Distill-Qwen-1.5B with LIMO
22
#15 opened about 2 months ago
by
lewtun
