Text Annotation Handbook: A Practical Guide for Machine Learning Projects Paper • 2310.11780 • Published Oct 18, 2023
SWEb: A Large Web Dataset for the Scandinavian Languages Paper • 2410.04456 • Published Oct 6, 2024 • 1
The Impact of Copyrighted Material on Large Language Models: A Norwegian Perspective Paper • 2412.09460 • Published Dec 12, 2024 • 9
Whispering in Norwegian: Navigating Orthographic and Dialectic Challenges Paper • 2402.01917 • Published Feb 2, 2024
Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks Paper • 2406.13469 • Published Jun 19, 2024
Addressing contingency in algorithmic (mis)information classification: Toward a responsible machine learning agenda Paper • 2210.09014 • Published Oct 5, 2022
Monitoring Model Deterioration with Explainable Uncertainty Estimation via Non-parametric Bootstrap Paper • 2201.11676 • Published Jan 27, 2022 • 1
MuMiN: A Large-Scale Multilingual Multimodal Fact-Checked Misinformation Social Network Dataset Paper • 2202.11684 • Published Feb 23, 2022
ScandEval: A Benchmark for Scandinavian Natural Language Processing Paper • 2304.00906 • Published Apr 3, 2023 • 4
HaT5: Hate Language Identification using Text-to-Text Transfer Transformer Paper • 2202.05690 • Published Feb 11, 2022
Småprat: DialoGPT for Natural Language Generation of Swedish Dialogue by Transfer Learning Paper • 2110.06273 • Published Oct 12, 2021