Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
205527.2
TFLOPS
1207
166
704
Lewis Tunstall
PRO
lewtun
Follow
thomwolf's profile picture
jizhongpeng's profile picture
abhishek's profile picture
1174 followers
·
83 following
https://lewtun.github.io/blog/
_lewtun
lewtun
AI & ML interests
LLMs, LLMs, LLMs
Recent Activity
published
a dataset
about 14 hours ago
HuggingFaceH4/lima
upvoted
an
article
1 day ago
Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers
liked
a Space
2 days ago
ysharma/mood-font-walkthrough
View all activity
Organizations
lewtun
's models
288
Sort: Recently updated
lewtun/Qwen3-32B-SFT-20250908120312
Updated
5 days ago
lewtun/Qwen3-0.6B-SFT-20250908114642
Text Generation
•
0.6B
•
Updated
5 days ago
•
7
lewtun/Qwen3-32B-SFT-20250908115917
Updated
5 days ago
lewtun/SmolLM2-135M-Instruct-SFT-Trackio-Test
Text Generation
•
0.1B
•
Updated
Aug 7
•
18
lewtun/Qwen3-0.6B-SFT-Trackio-Test
Text Generation
•
0.6B
•
Updated
Aug 7
•
16
lewtun/Qwen3-0.6B-SFT-Demo
Text Generation
•
0.6B
•
Updated
Aug 7
•
11
lewtun/zephyr-7b-gemma-dpo
Updated
Jul 24
lewtun/zephyr-7b-gemma-sft
Updated
Jul 24
lewtun/smollm2-360M-sft
Updated
Jul 24
lewtun/smollm2-1.7B-sft
Updated
Jul 24
lewtun/smollm-360M-instruct-new
Updated
Jul 24
lewtun/mistral-7b-sft-constitutional-ai
Updated
Jul 24
lewtun/mistral-7b-dpo-constitutional-ai
Updated
Jul 24
lewtun/zephyr-7b-sft-full
Text Generation
•
0.0B
•
Updated
Jul 24
•
5
lewtun/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
Apr 16
•
112
lewtun/does-deepspeed-still-work-sft
Text Generation
•
2B
•
Updated
Apr 16
•
8
lewtun/Llama-3.2-1B-SFT-Capybara-No-Packing-Llama
Text Generation
•
1B
•
Updated
Apr 16
•
7
lewtun/Qwen2.5-1.5B-SFT-Capybara-No-Packing
Text Generation
•
2B
•
Updated
Apr 15
•
7
lewtun/Llama-3.2-1B-SFT-Capybara-No-Packing-ChatML
Text Generation
•
1B
•
Updated
Apr 15
•
7
lewtun/Qwen2.5-7B-Instruct-GRPO
Updated
Mar 21
lewtun/Qwen2.5-Math-1.5B-Instruct-GRPO
Updated
Mar 6
lewtun/dummy-config-test
Text Generation
•
Updated
Feb 20
•
10
lewtun/Qwen2.5-1.5B-Open-R1-Code-GRPO
Updated
Feb 18
lewtun/smollm2-distill-default-chat-template
Text Generation
•
2B
•
Updated
Feb 17
•
7
lewtun/qwen2.5-1.5b-distill-default-chat-template
2B
•
Updated
Feb 17
•
8
lewtun/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
2B
•
Updated
Feb 7
•
7
lewtun/Qwen-2.5-7B-Simple-RL
Updated
Feb 7
lewtun/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
Feb 1
lewtun/Qwen2.5-1.5B-Open-R1-GRPO
Updated
Jan 31
lewtun/Qwen2-0.5B-SFT
Updated
Oct 17, 2024
Previous
1
2
3
...
10
Next