Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards into Open-Weight LLMs Paper • 2508.06601 • Published 7 days ago • 5
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published 7 days ago • 135
view article Article Introducing AI Sheets: a tool to work with datasets using open AI models! By dvilasuero and 5 others • 7 days ago • 51
Lessons from a Chimp: AI "Scheming" and the Quest for Ape Language Paper • 2507.03409 • Published Jul 4 • 1
Safer or Luckier? LLMs as Safety Evaluators Are Not Robust to Artifacts Paper • 2503.09347 • Published Mar 12 • 1
view article Article The GPT-OSS models are here… and they’re energy-efficient! By sasha • 8 days ago • 16
view article Article Why We Built the OpenMDW License: A Comprehensive License for ML Models By linuxfoundation • Jul 2 • 23
view changelog Changelog Introducing HF Jobs: Run scalable compute jobs on Hugging Face 16 days ago • 102
The Well Collection A 15TB collection of physics simulation datasets. • 18 items • Updated Mar 24 • 30
view article Article What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models By yjernite and 5 others • 11 days ago • 24
view article Article Introducing Command A Vision: Multimodal AI built for Business By CohereLabs and 3 others • 15 days ago • 62
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others • 17 days ago • 153
view article Article Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨ By Wauplin and 2 others • 21 days ago • 75
view changelog Changelog Inference Providers now fully support OpenAI-compatible API 28 days ago • 77
SmolLM3 evaluation datasets Collection Datasets to decontaminate the post-training mixtures against. Use the subset and column values described per entry • 13 items • Updated Jul 8 • 5