Stefan Schweter's picture

Stefan Schweter PRO

stefan-it

·

AI & ML interests

Flair Library 💕, NER & PoS Tagging, LM Pretraining (mostly encoder-only & encoder-decoder), Historical Language Models, German Language Models, Bavarian NLP

Recent Activity

liked a model 1 day ago

google/gemma-3-270m

liked a dataset 2 days ago

BramVanroy/CommonCrawl-CreativeCommons-fine

new activity 4 days ago

flair/ner-french:docs: fix import and usage of French NER dataset

View all activity

Organizations

liked a model 1 day ago

google/gemma-3-270m

Text Generation • 0.3B • Updated 2 days ago • 2.92k • 242

liked a dataset 2 days ago

BramVanroy/CommonCrawl-CreativeCommons-fine

Viewer • Updated 2 days ago • 71.5M • 56 • 1

New activity in flair/ner-french 4 days ago

docs: fix import and usage of French NER dataset

#2 opened 4 days ago by

upvoted a paper 4 days ago

GLiClass: Generalist Lightweight Model for Sequence Classification Tasks

Paper • 2508.07662 • Published 5 days ago • 8

commented a paper 4 days ago

GLiClass: Generalist Lightweight Model for Sequence Classification Tasks

Paper • 2508.07662 • Published 5 days ago • 8 •

published a dataset 6 days ago

bavarian-nlp/barwiki-dumps

Viewer • Updated 6 days ago • 12 • 63

updated a dataset 6 days ago

bavarian-nlp/barwiki-dumps

Viewer • Updated 6 days ago • 12 • 63

published a dataset 6 days ago

bavarian-nlp/barwiki-20250801

Viewer • Updated 6 days ago • 43.9k • 52

updated a dataset 6 days ago

bavarian-nlp/barwiki-20250801

Viewer • Updated 6 days ago • 43.9k • 52

updated a dataset 7 days ago

bavarian-nlp/bavarian-books

Viewer • Updated 7 days ago • 35 • 78

updated a Space 9 days ago

README

published a dataset 9 days ago

bavarian-nlp/bavarian-books

Viewer • Updated 7 days ago • 35 • 78

upvoted a collection 10 days ago

gpt-oss

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated 9 days ago • 283

commented a paper 19 days ago

Do Construction Distributions Shape Formal Language Learning In German BabyLMs?

Paper • 2503.11593 • Published Mar 14 • 1 •

upvoted 2 collections 19 days ago

German BabyLM

Data that can be used for developing developmentally plausible language models in German. • 13 items • Updated May 28 • 2

Teuken-7B-v0.6

OpenGPT-X Teuken 7B models trained on 6 trillion tokens. • 2 items • Updated 19 days ago • 4

liked a model 19 days ago

openGPT-X/Teuken-7B-base-v0.6

Text Generation • 7B • Updated 19 days ago • 325 • 6

upvoted a collection 19 days ago

LLäMmlein2Vec 🐑

4 items • Updated 19 days ago • 1

upvoted an article 21 days ago

Article

Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨

By

and 2 others •

22 days ago

• 75

upvoted a paper 22 days ago

GLiNER2: An Efficient Multi-Task Information Extraction System with Schema-Driven Interface

Paper • 2507.18546 • Published 22 days ago • 18