Leaderboard
Display a leaderboard of course submissions
The smollest course on post training
🚀 New Course material is LIVE! | Follow the org to sign up |
Welcome to the comprehensive (and smollest) course to Fine-Tuning Language Models!
This free course will take you on a journey, from beginner to expert, in understanding, implementing, and optimizing fine-tuning techniques for large language models.
This first unit will help you onboard:
Let's get started!
This course is smol but fast! It's for software developers and engineers looking to fast track their LLM fine-tuning skills. If that's not you, check out the LLM Course.
In this course, you will:
At the end of this course, you'll understand how to fine-tune language models effectively and build specialized AI applications using the latest fine-tuning techniques.
The course is composed of:
This course is a living project, evolving with your feedback and contributions! Feel free to open issues and PRs in GitHub, and engage in discussions in our Discord server.
Here is the general syllabus for the course. A more detailed list of topics will be released with each unit.
# | Topic | Description | Released |
---|---|---|---|
1 | Instruction Tuning | Supervised fine-tuning, chat templates, instruction following | ✅ |
2 | Evaluation | Benchmarks and custom domain evaluation | September |
3 | Preference Alignment | Aligning models to human preferences with algorithms like DPO. | October |
4 | Reinforcement Learning | Optimizing models with based on reinforcement policies. | October |
5 | Vision Language Models | Adapt and use multimodal models | November |
6 | Synthetic Data | Generate synthetic datasets for custom domains | November |
7 | Award Ceremony | Showcase projects and celebrate | December |
To be able to follow this course, you should have:
If you don't have any of these, don't worry. Check out the LLM Course to get started.
The above courses are not prerequisites in themselves, so if you understand the concepts of LLMs and transformers, you can start the course now!
You only need 2 things:
You can choose to follow this course in audit mode, or do the activities and get one of the two certificates we'll issue. If you audit the course, you can participate in all the challenges and do assignments if you want, and you don't need to notify us.
The certification process is completely free:
Each chapter in this course is designed to be completed in 1 week, with approximately 3-4 hours of work per week.
Since there's a deadline, we provide you a recommended pace:
To get the most out of the course, we have some advice:
About the authors:
Ben is a Machine Learning Engineer at Hugging Face who focuses on building LLM applications, with post training and agentic approaches. Follow Ben on the Hub to see his latest projects.
We would like to extend our gratitude to the following individuals and partners for their invaluable contributions and support:
Contributions are welcome 🤗
Please ask your question in our discord server #fine-tuning-course-questions.
Now that you have all the information, let's get on board ⛵
Display a leaderboard of course submissions
Test your knowledge of VLMs
Test your knowledge of Preference Alignment
Test your knowledge of SFT in the real world.