view article Article Fine-Tuning 1B LLaMA 3.2: A Comprehensive Step-by-Step Guide with Code By ImranzamanML • Oct 2, 2024 • 68
MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder Paper • 2505.07916 • Published 16 days ago • 119
SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning Paper • 2504.08600 • Published Apr 11 • 29
view article Article An Introduction to AI Model Optimization Techniques By PrunaAI and 1 other • Apr 18 • 28
view post Post 4786 We distill a more accurate and concise dataset from DeepSeek R1, and also provide a distillation pipeline code repository.🤗Dataset: SmallDoge/SmallThoughtsCode: https://github.com/SmallDoges/small-thoughts See translation 🚀 10 10 ❤️ 4 4 + Reply