view article Article Introducing ColQwen-Omni: Retrieve in every modality By manu and 4 others • 18 days ago • 60
view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency By not-lain • Jan 30 • 107
view article Article FastRTC: The Real-Time Communication Library for Python By freddyaboulton and 1 other • Feb 25 • 172
view article Article Fine-Tune Whisper with 🤗 Transformers By sanchit-gandhi • Nov 3, 2022 • 269