view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 264
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights Paper • 2509.22944 • Published Sep 26, 2025 • 79
view article Article Classement compar:IA : des votes des utilisateurs au classement participatif des modèles Nov 3, 2025 • 6
view article Article Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training +3 Aug 8, 2025 • 90
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! +10 Aug 5, 2025 • 508
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face +3 Jul 29, 2025 • 206
Flux quantized checkpoints Collection This collection regroups quantized flux checkpoints that we used in this blogpost: https://huggingface.co/blog/diffusers-quantization • 5 items • Updated Nov 26, 2025 • 2
view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs +7 Apr 29, 2025 • 43
view article Article 🔥 Announcing FLUX-Juiced: The Fastest Image Generation Endpoint (2.6 times faster)! Apr 23, 2025 • 12
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10, 2025 • 212
view article Article Memory-efficient Diffusion Transformers with Quanto and Diffusers Jul 30, 2024 • 68