view article Article 🤗 PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware Feb 10, 2023 • 69
phamvanlinh143/TinyLlama-1.1B-Chat-v1.0-GPTQ-4bit-gs128 Text Generation • Updated Aug 16, 2024 • 7
view article Article Overview of natively supported quantization schemes in 🤗 Transformers Sep 12, 2023 • 12
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA May 24, 2023 • 133