OpenEuroLLM

community

https://openeurollm.eu/

OpenEuroLLM

openeurollm

Activity Feed Request to join this org

AI & ML interests

Open, Multilingual, European, Generative, Foundational LLM

Recent Activity

geoalgo updated a dataset 5 days ago

openeurollm/nemotron-cc-10K-sample-translated

geoalgo published a dataset 8 days ago

openeurollm/nemotron-cc-10K-sample-translated

Villekom authored a paper 15 days ago

Got Compute, but No Data: Lessons From Post-training a Finnish LLM

View all activity

geoalgo

updated a dataset 5 days ago

openeurollm/nemotron-cc-10K-sample-translated

Viewer • Updated 8 days ago • 450k • 63

geoalgo

published a dataset 8 days ago

openeurollm/nemotron-cc-10K-sample-translated

Viewer • Updated 8 days ago • 450k • 63

Villekom

authored 2 papers 15 days ago

Got Compute, but No Data: Lessons From Post-training a Finnish LLM

Paper • 2503.09407 • Published Mar 12 • 1

An Expanded Massive Multilingual Dataset for High-Performance Language Technologies

Paper • 2503.10267 • Published Mar 13 • 1

geoalgo

authored a paper about 1 month ago

ARLBench: Flexible and Efficient Benchmarking for Hyperparameter Optimization in Reinforcement Learning

Paper • 2409.18827 • Published Sep 27, 2024

tiedeman

authored a paper about 1 month ago

Massively Multilingual Adaptation of Large Language Models Using Bilingual Translation Data

Paper • 2506.00469 • Published May 31 • 2

tiedeman

authored a paper 4 months ago

An Expanded Massive Multilingual Dataset for High-Performance Language Technologies

Paper • 2503.10267 • Published Mar 13 • 1

mbanon

authored a paper 4 months ago

An Expanded Massive Multilingual Dataset for High-Performance Language Technologies

Paper • 2503.10267 • Published Mar 13 • 1

gramirez-prompsit

updated a Space 4 months ago

README

🌍

gramirez-prompsit

published a Space 4 months ago

README

🌍

flxst

authored 2 papers 5 months ago

GPT-SW3: An Autoregressive Language Model for the Nordic Languages

Paper • 2305.12987 • Published May 22, 2023

Better Embeddings with Coupled Adam

Paper • 2502.08441 • Published Feb 12 • 1

mbanon

authored 2 papers 6 months ago

A New Massive Multilingual Dataset for High-Performance Language Technologies

Paper • 2403.14009 • Published Mar 20, 2024 • 1

FastSpell: the LangId Magic Spell

Paper • 2404.08345 • Published Apr 12, 2024

tiedeman

authored 6 papers 9 months ago

Uncertainty-Aware Natural Language Inference with Stochastic Weight Averaging

Paper • 2304.04726 • Published Apr 10, 2023

Sentence Embeddings in NLI with Iterative Refinement Encoders

Paper • 1808.08762 • Published Aug 27, 2018

Domain-specific Continued Pretraining of Language Models for Capturing Long Context in Mental Health

Paper • 2304.10447 • Published Apr 20, 2023 • 1

The University of Helsinki submissions to the WMT19 news translation task

Paper • 1906.04040 • Published Jun 10, 2019

Predicting Prosodic Prominence from Text with Pre-trained Contextualized Word Representations

Paper • 1908.02262 • Published Aug 6, 2019

XED: A Multilingual Dataset for Sentiment Analysis and Emotion Detection

Paper • 2011.01612 • Published Nov 3, 2020 • 1

AI & ML interests

Recent Activity

Team members 17

openeurollm's activity

README

README