Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
205527.2
TFLOPS
1203
152
678
Lewis Tunstall
PRO
lewtun
Follow
ArchiTop's profile picture
2stacks's profile picture
trgardos's profile picture
1140 followers
·
83 following
https://lewtun.github.io/blog/
_lewtun
lewtun
AI & ML interests
LLMs, LLMs, LLMs
Recent Activity
liked
a Space
1 day ago
timqian/like-history
liked
a dataset
2 days ago
jxm/gpt-oss20b-samples
new
activity
2 days ago
HuggingFaceH4/Multilingual-Thinking:
Update README.md
View all activity
Organizations
lewtun
's models
285
Sort: Recently updated
lewtun/SmolLM2-135M-Instruct-SFT-Trackio-Test
Text Generation
•
0.1B
•
Updated
5 days ago
•
11
lewtun/Qwen3-0.6B-SFT-Trackio-Test
Text Generation
•
0.6B
•
Updated
5 days ago
•
5
lewtun/Qwen3-0.6B-SFT-Demo
Text Generation
•
0.6B
•
Updated
5 days ago
•
6
lewtun/zephyr-7b-gemma-dpo
Updated
19 days ago
lewtun/zephyr-7b-gemma-sft
Updated
19 days ago
lewtun/smollm2-360M-sft
Updated
19 days ago
lewtun/smollm2-1.7B-sft
Updated
19 days ago
lewtun/smollm-360M-instruct-new
Updated
19 days ago
lewtun/mistral-7b-sft-constitutional-ai
Updated
19 days ago
lewtun/mistral-7b-dpo-constitutional-ai
Updated
19 days ago
lewtun/zephyr-7b-sft-full
Text Generation
•
0.0B
•
Updated
19 days ago
•
8
lewtun/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
Apr 16
•
6
lewtun/does-deepspeed-still-work-sft
Text Generation
•
2B
•
Updated
Apr 16
•
6
lewtun/Llama-3.2-1B-SFT-Capybara-No-Packing-Llama
Text Generation
•
1B
•
Updated
Apr 16
•
7
lewtun/Qwen2.5-1.5B-SFT-Capybara-No-Packing
Text Generation
•
2B
•
Updated
Apr 15
•
3
lewtun/Llama-3.2-1B-SFT-Capybara-No-Packing-ChatML
Text Generation
•
1B
•
Updated
Apr 15
•
4
lewtun/Qwen2.5-7B-Instruct-GRPO
Updated
Mar 21
lewtun/Qwen2.5-Math-1.5B-Instruct-GRPO
Updated
Mar 6
lewtun/dummy-config-test
Text Generation
•
Updated
Feb 20
•
2
lewtun/Qwen2.5-1.5B-Open-R1-Code-GRPO
Updated
Feb 18
lewtun/smollm2-distill-default-chat-template
Text Generation
•
2B
•
Updated
Feb 17
•
2
lewtun/qwen2.5-1.5b-distill-default-chat-template
2B
•
Updated
Feb 17
•
3
lewtun/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
2B
•
Updated
Feb 7
•
2
lewtun/Qwen-2.5-7B-Simple-RL
Updated
Feb 7
lewtun/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
Feb 1
lewtun/Qwen2.5-1.5B-Open-R1-GRPO
Updated
Jan 31
lewtun/Qwen2-0.5B-SFT
Updated
Oct 17, 2024
lewtun/Qwen2.5-0.5B-SFT-LoRA
Updated
Sep 30, 2024
lewtun/Llama-3.1-8B-SFT-LoRA-packing-no-lm-head
Updated
Sep 30, 2024
lewtun/Llama-3.1-8B-SFT-LoRA-no-packing
Updated
Sep 30, 2024
Previous
1
2
3
...
10
Next