MultiSlav Collection Multilingual Machine Translation Open-Source Slavic models • 19 items • Updated 3 days ago • 8
Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 68 items • Updated 13 days ago • 111
Aya Datasets Collection The Aya Collection is a massive multilingual collection for over 100 languages consisting of 513 million instances of prompts and completions. • 5 items • Updated 7 days ago • 16
Polish Language Models Collection Collection of pre-trained and fine-tuned Polish Language Models • 14 items • Updated Oct 17, 2024 • 1
Polish Question Answering Collection Collection of models and datasets for Polish Question Answering. • 18 items • Updated Oct 17, 2024 • 4
SilverRetriever: Advancing Neural Passage Retrieval for Polish Question Answering Paper • 2309.08469 • Published Sep 15, 2023 • 3
HerBERT: Efficiently Pretrained Transformer-based Language Model for Polish Paper • 2105.01735 • Published May 4, 2021 • 1
Evaluation of Transfer Learning for Polish with a Text-to-Text Model Paper • 2205.08808 • Published May 18, 2022 • 1
Going beyond research datasets: Novel intent discovery in the industry setting Paper • 2305.05474 • Published May 9, 2023 • 1
KLEJ: Comprehensive Benchmark for Polish Language Understanding Paper • 2005.00630 • Published May 1, 2020 • 1
Semi-Supervised Neural System for Tagging, Parsing and Lematization Paper • 2004.12450 • Published Apr 26, 2020 • 1
MAUPQA: Massive Automatically-created Polish Question Answering Dataset Paper • 2305.05486 • Published May 9, 2023 • 1
Improving Question Answering Performance through Manual Annotation: Costs, Benefits and Strategies Paper • 2212.08897 • Published Dec 17, 2022 • 2