view article Article Accelerating LLM Code Generation Through Mask Store Streamlining By vivien β’ Jan 17 β’ 2
view article Article KV Cache from scratch in nanoVLM By ariG23498 and 4 others β’ 23 days ago β’ 78
Utilities Collection No crazy stuff, but useful ones for in-between steps β’ 16 items β’ Updated Mar 19 β’ 7
π¦π Useful Tiny Video Converters Collection All spaces made to convert a video (of GIFs) to anything useful in your pipelines β’ 5 items β’ Updated Oct 3, 2024 β’ 7
view article Article Interactive Tools for machine learning, deep learning, and math By Suzana β’ May 26 β’ 44
view article Article Exploring Quantization Backends in Diffusers By derekl35 and 2 others β’ May 21 β’ 37
view article Article CodeAgents + Structure: A Better Way to Execute Actions By akseljoonas and 1 other β’ about 1 month ago β’ 60
D-FINE Collection State-of-the-art real-time object detection model with Apache 2.0 licence β’ 15 items β’ Updated May 5 β’ 55
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory β’ 15 items β’ Updated 28 days ago β’ 201
Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory Paper β’ 2504.19413 β’ Published Apr 28 β’ 17
Vidi: Large Multimodal Models for Video Understanding and Editing Paper β’ 2504.15681 β’ Published Apr 22 β’ 15
Excellent SLM & SVLM Collection Excellent SLM (small language models) and SVLM (small vison language models). β’ 29 items β’ Updated Apr 1 β’ 4
Gemini Embedding: Generalizable Embeddings from Gemini Paper β’ 2503.07891 β’ Published Mar 10 β’ 39