view article Article How Much Power does a SOTA Open Video Model Use? ⚡🎥 By jdelavande and 2 others • 3 days ago • 8
view article Article Teaching Data Literacy with Hugging Face's AI Sheets By ParulPandey • 6 days ago • 23
view article Article Open Source AI: A Cornerstone of Digital Sovereignty By frimelle and 1 other • 24 days ago • 16
view article Article MCP is at a Tipping Point: Here's Why You Should Care By fdaudens • 25 days ago • 16
BehaviorBox: Automated Discovery of Fine-Grained Performance Differences Between Language Models Paper • 2506.02204 • Published Jun 2 • 1
view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! about 1 month ago • 49
Common Pile v0.1 Collection All resources related to Common Pile v0.1, an 8TB dataset of public domain and openly licensed text • 4 items • Updated 30 days ago • 26
Reward Bench 2 Collection Datasets, spaces, and models for Reward Bench 2 benchmark and paper! • 11 items • Updated Jun 3 • 12
view article Article *Context Is Gold to Find the Gold Passage*: Evaluating and Training Contextual Document Embeddings By manu and 1 other • Jun 2 • 24
view article Article AI Policy @🤗: Response to the 2025 National AI R&D Strategic Plan By evijit and 2 others • Jun 2 • 13
view article Article CodeAgents + Structure: A Better Way to Execute Actions By akseljoonas and 1 other • May 28 • 63