SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper โข 2502.02737 โข Published 17 days ago โข 188
view article Article From PyTorch DDP to ๐ค Accelerate to ๐ค Trainer, mastery of distributed training with ease Oct 21, 2022 โข 23
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders Paper โข 2410.22366 โข Published Oct 28, 2024 โข 78
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order Paper โข 2404.00399 โข Published Mar 30, 2024 โข 42