view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • 2 days ago • 413
view article Article cocogold: training Marigold for text-grounded segmentation By pcuenq • 1 day ago • 21
view article Article Gemma 3n fully available in the open-source ecosystem! By ariG23498 and 7 others • 14 days ago • 105
Quartet: Native FP4 Training Can Be Optimal for Large Language Models Paper • 2505.14669 • Published May 20 • 76
Gemma 3 Collection A collection of lightweight, state-of-the-art open models built from the same research and technology that powers the Gemini 2.0 models • 32 items • Updated May 14 • 28
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM By ariG23498 and 3 others • Mar 12 • 443
Reducing Transformer Key-Value Cache Size with Cross-Layer Attention Paper • 2405.12981 • Published May 21, 2024 • 34
Core ML Text Generation Collection [WIP] On-device LLMs https://huggingface.co/blog/swift-coreml-llm • 3 items • Updated Sep 7, 2023 • 4
view article Article Halo: Open Source Health Tracking with Wearables By cyrilzakka • Nov 19, 2024 • 112
view article Article WWDC 24: Running Mistral 7B with Core ML By FL33TW00D-HF and 3 others • Jul 22, 2024 • 61