AI & ML interests

Developing foundation models for low-resource languages.

Recent Activity

nicholasKluge  updated a Space about 1 month ago
Polygl0t/README
nicholasKluge  updated a collection about 1 month ago
ViTucano-v1 (Portuguese)
nicholasKluge  updated a collection about 1 month ago
ViTucano-v1 (Portuguese)
View all activity

Polyglot is an initiative to close the linguistic divide in NLP by developing efficient and accessible foundation models for low-resource languages.

While recent breakthroughs in generative AI have been driven by large-scale foundation models, these advances have largely benefited high-resource languages, leaving many underrepresented languages behind. The current deep learning paradigm—heavily reliant on massive datasets and computing power—has unintentionally widened this gap, making it harder for speakers of low-resource languages to access and shape AI technologies that reflect their linguistic and cultural identities.

Polyglot addresses this imbalance by creating tools, models, and datasets that support open, sustainable, and inclusive AI development. We aim to empower researchers and communities working with low-resource languages through high-quality open-source resources, enabling them to build and fine-tune language models tailored to their needs.

Recent Publications 📚

News 🚀

Community Contributions 🤝

Polyglot is a project funded by the Federal Ministry of Education and Research (BMBF) and the Ministry of Culture and Science of the State of North Rhine-Westphalia (MWK) as part of TRA Sustainable Futures (University of Bonn) and the Excellence Strategy of the federal and state governments.

models 0

None public yet

datasets 0

None public yet