view article Article Introducing Training Cluster as a Service - a new collaboration with NVIDIA By jeffboudier and 2 others β’ 6 days ago β’ 20
view article Article Microsoft and Hugging Face expand collaboration By jeffboudier and 2 others β’ 29 days ago β’ 22
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others β’ May 15 β’ 113
view article Article Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models. By tiiuae and 9 others β’ May 15 β’ 35
view article Article Reduce, Reuse, Recycle: Why Open Source is a Win for Sustainability By sasha and 1 other β’ May 7 β’ 14
view article Article Introducing AutoRound: Intelβs Advanced Quantization for LLMs and VLMs By wenhuach and 8 others β’ Apr 29 β’ 33
view article Article Cohere on Hugging Face Inference Providers π₯ By burtenshaw and 6 others β’ Apr 16 β’ 126
view article Article Comparing sub 50GB Llama 4 Scout quants (KLD/Top P) By bartowski β’ Apr 9 β’ 40
view article Article π¦Έπ»#14: What Is MCP, and Why Is Everyone β Suddenly!β Talking About It? By Kseniase β’ Mar 17 β’ 290
Kimi-VL-A3B Collection Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking β’ 6 items β’ Updated Apr 12 β’ 65
view article Article CPU Optimized Embeddings with π€ Optimum Intel and fastRAG By peterizsak and 5 others β’ Mar 15, 2024 β’ 10
Adaptive Orchestration for Large-Scale Inference on Heterogeneous Accelerator Systems Balancing Cost, Performance, and Resilience Paper β’ 2503.20074 β’ Published Mar 25 β’ 6
view article Article NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets By mingyuliutw and 4 others β’ Mar 18 β’ 41
view article Article Introducing Llama Vision-Instruct Models with DigitalOcean 1-Click GPU Droplets By JamesDigitalOcean β’ Mar 14 β’ 4
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM By ariG23498 and 3 others β’ Mar 12 β’ 434