Lewis Tunstall's picture

Lewis Tunstall PRO

lewtun

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

published a model about 11 hours ago

lewtun/Qwen3-32B-SFT-20250908120312

updated a model about 11 hours ago

lewtun/Qwen3-0.6B-SFT-20250908114642

published a model about 12 hours ago

lewtun/Qwen3-32B-SFT-20250908115917

View all activity

Organizations

upvoted a paper 3 days ago

Open Data Synthesis For Deep Research

Paper • 2509.00375 • Published 10 days ago • 58

upvoted an article 6 days ago

Article

Make your ZeroGPU Spaces go brrr with PyTorch ahead-of-time compilation

By

and 3 others •

7 days ago

• 44

upvoted a paper 12 days ago

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes

Paper • 2306.13649 • Published Jun 23, 2023 • 25

upvoted a paper 13 days ago

Deep Think with Confidence

Paper • 2508.15260 • Published 19 days ago • 81

upvoted 2 papers 17 days ago

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

Paper • 2508.15760 • Published 18 days ago • 44

MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers

Paper • 2508.14704 • Published 19 days ago • 42

upvoted an article 19 days ago

Article

Old Maps, New Terrain: Updating Labour Taxonomies for the AI Era

By

and 1 other •

20 days ago

• 15

upvoted a paper 20 days ago

τ^2-Bench: Evaluating Conversational Agents in a Dual-Control Environment

Paper • 2506.07982 • Published Jun 9 • 6

upvoted an article 21 days ago

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

By

and 1 other •

22 days ago

• 54

upvoted an article 26 days ago

Article

Announcing the Synthetic Online Conversations Dataset (SOC)

By

•

27 days ago

• 11

upvoted a paper 27 days ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8 • 175

upvoted 3 articles about 1 month ago

Article

The GPT-OSS models are here… and they’re energy-efficient!

By

•

Aug 7

• 19

Article

Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training

By

and 4 others •

Aug 8

• 60

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

By

and 3 others •

Dec 9, 2022

• 337

upvoted a collection about 1 month ago

IFBench

Datasets for IFBench benchmark and paper! • 3 items • Updated Jul 3 • 5

upvoted an article about 1 month ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

By

and 11 others •

Aug 5

• 492

upvoted a collection about 1 month ago

gpt-oss

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 338

upvoted 3 articles about 1 month ago

Article

Mixture of Experts Explained

By

and 5 others •

Dec 11, 2023

• 883

Article

Introducing Command A Vision: Multimodal AI built for Business

By

and 3 others •

Jul 31

• 63

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

By

and 4 others •

Jul 29

• 170