Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
3591
29
12
Loïck BOURDOIS
lbourdois
Follow
ntnq's profile picture
Kaniass's profile picture
velaia's profile picture
105 followers
·
82 following
https://lbourdois.github.io/blog/
BdsLoick
lbourdois
lbourdois
lbourdois.bsky.social
AI & ML interests
👀
Recent Activity
posted
an
update
5 days ago
We introduce FAT5 (Flash Attention T5) ⚡ An implementation of T5 in PyTorch with UL2 objective optimized for GPGPU for both training and inference thanks to 13 different optimizations. The main one is that we have designed a CUDA kernel to expand the Flash Attention by @tridao with RPE biases and supports other PE such as RoPE, ALiBi or FIRE. The result kernel is 2 times faster than a SPDA implementation. We also use Triton kernels to optimize certain parts of the architecture, such as the cross-entropy and RMSNorm layer. The various kernels have been carefully built to be compatible with BF16 and torch.compile to go even faster and achieve efficient pretraining. All other optimizations are described in a 📝 subsequent blog post available on @huggingface 🤗: https://huggingface.co/spaces/CATIE-AQ/FAT5-report. This methodology enabled us to efficiently pretrain as a proof of concept a FAT5 with 147M parameters in French in a reasonable time (1,461H for 419B tokens), with limited resources (1 A100 i.e. a computational budget of ~ €1,900) and a low carbon footprint (13.5kg eq CO2). The model's weights are also available on Hugging Face: https://huggingface.co/CATIE-AQ/FAT5-small. Not very useful in practice, it's a PoC and not an instructed model (it's planned for later). All the code is available on GitHub if you want to pretrain your own model in your own language or for a specific domain: https://github.com/catie-aq/flashT5 ⭐ Ending by indicating that was a joint project with @BorisAlbar at hf.co/CATIE-AQ.
updated
a dataset
10 days ago
Bretagne/Lingua_Libre
updated
a dataset
10 days ago
Bretagne/Banque_Sonore_Dialectes_Bretons
View all activity
Organizations
lbourdois
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
published
an
article
8 months ago
view article
Article
Introduction to State Space Models (SSM)
By
lbourdois
•
Jul 19, 2024
•
115
published
an
article
12 months ago
view article
Article
History of State Space Models (SSM) in 2022
By
lbourdois
•
Apr 11, 2024
•
18
published
an
article
about 1 year ago
view article
Article
Introduction to LLE
By
lbourdois
•
Feb 1, 2024
•
3