view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM By ariG23498 and 3 others • Mar 12 • 439
view article Article Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub By drbh and 6 others • 24 days ago • 105
view article Article From PyTorch DDP to 🤗 Accelerate to 🤗 Trainer, mastery of distributed training with ease By muellerzr • Oct 21, 2022 • 32
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper • 2506.05209 • Published 30 days ago • 42
view article Article How to train a new language model from scratch using Transformers and Tokenizers By julien-c • Feb 14, 2020 • 41
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention By sirluk • Oct 7, 2024 • 43
OpenLLMTurkishLeadboard Datasets Collection This Collection contains a mix of benchmarks. used for evaluation in the openllm lead-board for Turkish LLMs • 6 items • Updated Apr 26, 2024 • 4
view article Article Open-source DeepResearch – Freeing our search agents By m-ric and 4 others • Feb 4 • 1.26k