GLiClass: Generalist Lightweight Model for Sequence Classification Tasks Paper • 2508.07662 • Published 5 days ago • 8
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated 9 days ago • 283
German BabyLM Collection Data that can be used for developing developmentally plausible language models in German. • 13 items • Updated May 28 • 2
Teuken-7B-v0.6 Collection OpenGPT-X Teuken 7B models trained on 6 trillion tokens. • 2 items • Updated 19 days ago • 4
view article Article Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨ By Wauplin and 2 others • 22 days ago • 75
GLiNER2: An Efficient Multi-Task Information Extraction System with Schema-Driven Interface Paper • 2507.18546 • Published 22 days ago • 18
Effective Multi-Task Learning for Biomedical Named Entity Recognition Paper • 2507.18542 • Published 22 days ago • 1
Checklists Are Better Than Reward Models For Aligning Language Models Paper • 2507.18624 • Published 22 days ago • 2
Exploring Gender Bias in Large Language Models: An In-depth Dive into the German Language Paper • 2507.16557 • Published 25 days ago • 2
GG-BBQ: German Gender Bias Benchmark for Question Answering Paper • 2507.16410 • Published 25 days ago • 2
Enhancing Multilingual LLM Pretraining with Model-Based Data Selection Paper • 2502.10361 • Published Feb 14 • 1
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • Jul 8 • 625
view article Article What is the Hugging Face Community Building? By evijit and 2 others • Jul 15 • 12
TUMLU: A Unified and Native Language Understanding Benchmark for Turkic Languages Paper • 2502.11020 • Published Feb 16 • 8
Different Tastes of Entities: Investigating Human Label Variation in Named Entity Annotations Paper • 2402.01423 • Published Feb 2, 2024 • 1
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities Paper • 2507.06261 • Published Jul 7 • 59