Marqo-Ecommerce-Embeddings Collection State-of-the-art embedding models fine-tuned for the ecommerce domain. +67% increase in evaluation metrics vs ViT-B-16-SigLIP. • 10 items • Updated 3 days ago • 14
NLI Eval Datasets Collection A curated collection of NLI evaluation datasets. Each dataset is exactly as originally proposed • 19 items • Updated 5 days ago • 2
BhasaAnuvaad Collection A Speech Translation Dataset for 13 Indian Languages • 11 items • Updated 3 days ago • 8
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais • 4 days ago • 88
view article Article Transformers.js v3: WebGPU support, new models & tasks, and more… 27 days ago • 60
DataEnvGym Collection Skills, datasets, etc for DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback • 6 items • Updated Oct 10 • 1
view article Article 🥐CroissantLLM: A Truly Bilingual French-English Language Model By manu • Feb 5 • 10
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 24 days ago • 467
view article Article 🌟 Easy Fine-Tuning with Hugging Face SQL Console, Notebook Creator, and SFT By asoria • Sep 24 • 12
view article Article Deep Learning over the Internet: Training Language Models Collaboratively Jul 15, 2021 • 4