Quentin Lhoest PRO

lhoestq

AI & ML interests

Maintainer of 🤗Datasets: NLP, Multimodal data processing and sharing

Articles

Organizations

lhoestq's activity

upvoted an article 4 days ago
view article
Article

Releasing the largest multilingual open pretraining dataset

88
upvoted an article 26 days ago
view article
Article

Transformers.js v3: WebGPU support, new models & tasks, and more…

60
upvoted an article about 1 month ago
upvoted 3 articles about 1 month ago
view article
Article

Scaling AI-based Data Processing with Hugging Face + Dask

23
view article
Article

Improving Parquet Dedupe on Hugging Face Hub

29
view article
Article

🥐CroissantLLM: A Truly Bilingual French-English Language Model

By manu
10
upvoted an article about 2 months ago
view article
Article

FineVideo: behind the scenes

23
upvoted an article about 2 months ago
view article
Article

🌟 Easy Fine-Tuning with Hugging Face SQL Console, Notebook Creator, and SFT

By asoria
12
upvoted an article 2 months ago
view article
Article

Introducing the SQL Console on Datasets

18
upvoted 2 articles 3 months ago
view article
Article

Scaling robotics datasets with video encoding

34
view article
Article

Deep Learning over the Internet: Training Language Models Collaboratively

4