view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others • 14 days ago • 103
view article Article The New and Fresh analytics in Inference Endpoints By erikkaum and 4 others • Mar 21 • 21
view article Article Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥 By reach-vb and 6 others • Feb 18 • 99
view article Article How to deploy and fine-tune DeepSeek models on AWS By pagezyhf and 2 others • Jan 30 • 52
view article Article Yay! Organizations can now publish blog Articles By huggingface and 3 others • Jan 20 • 45
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference By mfuntowicz and 1 other • Jan 16 • 74
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated 30 days ago • 365
view article Article The 5 Most Under-Rated Tools on Hugging Face By derek-thomas • Aug 22, 2024 • 89
view article Article Serverless Inference with Hugging Face and NVIDIA NIMs By philschmid and 1 other • Jul 29, 2024 • 31
view article Article SmolLM - blazingly fast and remarkably powerful By loubnabnl and 2 others • Jul 16, 2024 • 373
view article Article Google Cloud TPUs made available to Hugging Face users By pagezyhf and 3 others • Jul 9, 2024 • 19
view article Article Our Transformers Code Agent beats the GAIA benchmark! By m-ric and 1 other • Jul 1, 2024 • 88
view article Article Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap By Violette and 3 others • Jun 19, 2024 • 11
view article Article Hugging Face on AMD Instinct MI300 GPU By mfuntowicz and 3 others • May 21, 2024 • 14
view article Article Deploy models on AWS Inferentia2 from Hugging Face By philschmid and 1 other • May 22, 2024 • 13