Loïck BOURDOIS

lbourdois

10541 48 19

https://lbourdois.github.io/blog/

AI & ML interests

👀

Recent Activity

updated a model 1 day ago

alphaedge-ai/siglip2-so400m-patch16-512-zho-16384

updated a model 1 day ago

alphaedge-ai/siglip2-so400m-patch16-512-ydd-16384

updated a model 1 day ago

alphaedge-ai/siglip2-so400m-patch16-512-vie-16384

View all activity

Organizations

Posts 6

Post

1089

New blog post!
An introduction to a little-known but highly effective model reduction method: 𝗧𝗿𝗶𝗺𝗺𝗶𝗻𝗴✂️
We show how to reduce model size (we went up to 87.24% reduction) while preserving its performance.

We applied this technique to 16 different model families across several modalities to illustrate that it works on any architecture (as long as the embedding layer is the last one of the model) and on any modality involving text.
From these 16 families, we generated over 𝟱,𝟱𝟬𝟬 𝗺𝗼𝗻𝗼𝗹𝗶𝗻𝗴𝘂𝗮𝗹 𝗺𝗼𝗱𝗲𝗹𝘀 𝗶𝗻 𝟭𝟮𝟰 𝗱𝗶𝗳𝗳𝗲𝗿𝗲𝗻𝘁 𝗹𝗮𝗻𝗴𝘂𝗮𝗴𝗲𝘀 🌍

Key takeaways from our experiments:
1️⃣ Trimming does not require a GPU. Our models were obtained on a CPU.
2️⃣ This method scales up to at least 4B parameters (we did not test beyond that).
3️⃣ Trimmed model is smaller than the original while preserving its performance. If you observe a slight performance drop, just fine-tuned to recover or even surpass the original performance.
4️⃣ For an equivalent compute budget, it is better to trim then fine-tune rather than fine-tuning the original model. Since the model is smaller, you can run more epochs/show more data and get in fine a better model than the original.
5️⃣ Trimming is a competitive alternative to distillation and quantization. E.g. we obtained our alternative to DistilBERT in 9 minutes on CPU vs. 90 hours of GPU for the latter.
6️⃣ Trimming could generate reasoning traces in the language of the trimmed model. This could be an alternative to generating traces in English and then translating them into the desired language.

And many other things (such as how much data are needed, the impact of the database used, the order in which it should be done, etc.) are available in the blogpost!

Blogpost: https://huggingface.co/blog/lbourdois/introduction-to-trimming
Models: alphaedge-ai/Trimming_models_search

View all Posts

Articles 5

Article

Introduction to Trimming ✂

View all Articles

Collections 18

View 18 collections

spaces 2

Free online AI courses in French

📚

French translations of five AI courses

SSM Blog Posts

📝

Blog posts about State Space Models (SSM)

models 0

None public yet

datasets 82

Loïck BOURDOIS

AI & ML interests

Recent Activity

Organizations

Posts 6

Articles 5

Introduction to Trimming ✂

Collections 18

French prompts datasets

French DPO and conversation datasets

French think and toolcalling datasets

French embedding datasets

Free online AI courses in French

lbourdois/en-fr-nyu-dl-course-corpus

SSM Blog Posts

Guide sur l'évaluation des LLM

French prompts datasets

French DPO and conversation datasets

French think and toolcalling datasets

French embedding datasets

Free online AI courses in French

lbourdois/en-fr-nyu-dl-course-corpus

SSM Blog Posts

Guide sur l'évaluation des LLM

spaces 2

Free online AI courses in French

SSM Blog Posts

models 0

datasets 82

lbourdois/smolLM3_french_data

lbourdois/fineweb-2-trimming

lbourdois/images

lbourdois/PDF_cours_NYU_DL

lbourdois/caption-wit_base_french

lbourdois/VQA-Marsouuu-central-banks-and-monetary-authorities-reports-clean

lbourdois/VQA-Marsouuu-french-kid-positive-only-clean

lbourdois/VQA-tabmwp

lbourdois/VQA-nlp-waseda-KnowRecall-clean

lbourdois/caption-vsr

Loïck BOURDOIS

AI & ML interests

Recent Activity

Organizations

Posts 6

Articles 5

Introduction to Trimming ✂

Collections 18

Free online AI courses in French

SSM Blog Posts

Guide sur l'évaluation des LLM

Free online AI courses in French

SSM Blog Posts

Guide sur l'évaluation des LLM

spaces 2 Sort: Recently updated

Free online AI courses in French

SSM Blog Posts

models 0

datasets 82 Sort: Recently updated

spaces 2

datasets 82