view article Article GaLore: Advancing Large Model Training on Consumer-grade Hardware By Titus-von-Koeller and 8 others β’ Mar 20, 2024 β’ 29
view article Article π€ PEFT welcomes new merging methods By smangrul and 1 other β’ Feb 19, 2024 β’ 20
view article Article Mixture of Experts Explained By osanseviero and 5 others β’ Dec 11, 2023 β’ 714
view article Article Personal Copilot: Train Your Own Coding Assistant By smangrul and 1 other β’ Oct 27, 2023 β’ 64
view article Article Fine-tuning Llama 2 70B using PyTorch FSDP By smangrul and 3 others β’ Sep 13, 2023 β’ 25
view article Article The Falcon has landed in the Hugging Face ecosystem By lvwerra and 7 others β’ Jun 5, 2023 β’ 14
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA By ybelkada and 4 others β’ May 24, 2023 β’ 156
view article Article Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU By edbeeching and 5 others β’ Mar 9, 2023 β’ 57
view article Article π€ PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware By smangrul and 1 other β’ Feb 10, 2023 β’ 86
view article Article Accelerate Large Model Training using DeepSpeed By smangrul and 1 other β’ Jun 28, 2022 β’ 6
view article Article Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel By smangrul and 1 other β’ May 2, 2022 β’ 4