Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
205527.2
TFLOPS
1207
183
741
Lewis Tunstall
PRO
lewtun
Follow
Ekata's profile picture
k3ybladewielder's profile picture
Balaj's profile picture
1209 followers
·
85 following
https://lewtun.github.io/blog/
_lewtun
lewtun
AI & ML interests
LLMs, LLMs, LLMs
Recent Activity
upvoted
a
paper
40 minutes ago
Bridging Offline and Online Reinforcement Learning for LLMs
upvoted
a
paper
43 minutes ago
CWM: An Open-Weights LLM for Research on Code Generation with World Models
upvoted
a
collection
about 14 hours ago
Environment Hub
View all activity
Organizations
lewtun
's models
288
Sort: Recently updated
lewtun/Qwen3-32B-SFT-20250908120312
Updated
Sep 8
lewtun/Qwen3-0.6B-SFT-20250908114642
Text Generation
•
0.6B
•
Updated
Sep 8
•
13
lewtun/Qwen3-32B-SFT-20250908115917
Updated
Sep 8
lewtun/SmolLM2-135M-Instruct-SFT-Trackio-Test
Text Generation
•
0.1B
•
Updated
Aug 7
•
9
lewtun/Qwen3-0.6B-SFT-Trackio-Test
Text Generation
•
0.6B
•
Updated
Aug 7
•
21
lewtun/Qwen3-0.6B-SFT-Demo
Text Generation
•
0.6B
•
Updated
Aug 7
•
24
lewtun/zephyr-7b-gemma-dpo
Updated
Jul 24
lewtun/zephyr-7b-gemma-sft
Updated
Jul 24
lewtun/smollm2-360M-sft
Updated
Jul 24
lewtun/smollm2-1.7B-sft
Updated
Jul 24
lewtun/smollm-360M-instruct-new
Updated
Jul 24
lewtun/mistral-7b-sft-constitutional-ai
Updated
Jul 24
lewtun/mistral-7b-dpo-constitutional-ai
Updated
Jul 24
lewtun/zephyr-7b-sft-full
Text Generation
•
266k
•
Updated
Jul 24
•
1
lewtun/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
Apr 16
•
1
lewtun/does-deepspeed-still-work-sft
Text Generation
•
2B
•
Updated
Apr 16
lewtun/Llama-3.2-1B-SFT-Capybara-No-Packing-Llama
Text Generation
•
1B
•
Updated
Apr 16
lewtun/Qwen2.5-1.5B-SFT-Capybara-No-Packing
Text Generation
•
2B
•
Updated
Apr 15
•
2
lewtun/Llama-3.2-1B-SFT-Capybara-No-Packing-ChatML
Text Generation
•
1B
•
Updated
Apr 15
•
2
lewtun/Qwen2.5-7B-Instruct-GRPO
Updated
Mar 21
lewtun/Qwen2.5-Math-1.5B-Instruct-GRPO
Updated
Mar 6
lewtun/dummy-config-test
Text Generation
•
Updated
Feb 20
lewtun/Qwen2.5-1.5B-Open-R1-Code-GRPO
Updated
Feb 18
lewtun/smollm2-distill-default-chat-template
Text Generation
•
2B
•
Updated
Feb 17
lewtun/qwen2.5-1.5b-distill-default-chat-template
2B
•
Updated
Feb 17
•
1
lewtun/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
2B
•
Updated
Feb 7
lewtun/Qwen-2.5-7B-Simple-RL
Updated
Feb 7
lewtun/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
Feb 1
lewtun/Qwen2.5-1.5B-Open-R1-GRPO
Updated
Jan 31
lewtun/Qwen2-0.5B-SFT
Updated
Oct 17, 2024
Previous
1
2
3
...
10
Next