view article Article Vision Language Models (Better, faster, stronger) +3 merve, sergiopaniego, ariG23498, pcuenq, andito β’ May 12, 2025 β’ 612
view article Article Fine-Tune Whisper For Multilingual ASR with π€ Transformers sanchit-gandhi β’ Nov 3, 2022 β’ 371
view article Article SmolVLM - small yet mighty Vision Language Model +3 andito, merve, mfarre, eliebak, pcuenq β’ Nov 26, 2024 β’ 418
view article Article Assisted Generation: a new direction toward low-latency text generation joaogante β’ May 11, 2023 β’ 79
view article Article Introduction to Quantization cooked in π€ with ππ§βπ³ merve β’ Aug 25, 2023 β’ 39
view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth mlabonne β’ Jul 29, 2024 β’ 371
view article Article Mixture of Experts Explained +4 osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq β’ Dec 11, 2023 β’ 1.13k
view article Article π³οΈ Attention Sinks in LLMs for endless fluency tomaarsen β’ Oct 9, 2023 β’ 37
view article Article Constitutional AI with Open LLMs +5 vwxyzjn, lewtun, edbeeching, lvwerra, osanseviero, kashif, thomwolf β’ Feb 1, 2024 β’ 17
view article Article Preference Tuning LLMs with Direct Preference Optimization Methods +3 kashif, edbeeching, lewtun, lvwerra, osanseviero β’ Jan 18, 2024 β’ 83
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 natolambert, LouisCastricato, lvwerra, Dahoas β’ Dec 9, 2022 β’ 413
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA +3 ybelkada, timdettmers, artidoro, sgugger, smangrul β’ May 24, 2023 β’ 180