Troy Baker's picture
8 49

Troy Baker

jtroybaker
·

AI & ML interests

Predictive Maintenance, Reinforcement Learning, Natural Language Processing

Recent Activity

Organizations

ZeroGPU Explorers's profile picture Hugging Face Discord Community's profile picture

jtroybaker's activity

reacted to burtenshaw's post with 👍 about 17 hours ago
view post
Post
1971
Qwen 3 Fine tuning >> MoE. Update the experiment thread to include config and script for fine-tuning the Qwen3-30B-A3B model.

The goal is to make a low latency non-thinking model for a daily driver coding, so 3 billion parameters active should be perfect.

✔️ training running
✔️ evals running
⏭️ improve dataset

The moe isn't going to fit into colab's A100 even with quantization (🙏 @UnslothAI ). So I've been working on HF spaces' H100s for this. Everything is available in the tread and I'll share more tomorrow.

burtenshaw/Qwen3-Code-Lite#1