Piotr Rybak's picture

Piotr Rybak

piotr-rybak

·

360er0

AI & ML interests

Polish NLP

Recent Activity

liked a model 5 days ago

Qwen/Qwen3-Embedding-0.6B-GGUF

liked a dataset 18 days ago

ministere-culture/comparia-conversations

liked a model 27 days ago

stepfun-ai/Step1X-3D

View all activity

Organizations

piotr-rybak's activity

upvoted a collection 4 months ago

MultiSlav

Multilingual Machine Translation Open-Source Slavic models • 19 items • Updated Mar 7 • 8

upvoted a collection 12 months ago

Embedding Model Datasets

A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 70 items • Updated Apr 7 • 128

upvoted a collection about 1 year ago

Aya Datasets

The Aya Collection is a massive multilingual collection for over 100 languages consisting of 513 million instances of prompts and completions. • 5 items • Updated Mar 2 • 20

upvoted 2 collections over 1 year ago

Polish Language Models

Collection of pre-trained and fine-tuned Polish Language Models • 19 items • Updated 29 days ago • 1

Polish Question Answering

Collection of models and datasets for Polish Question Answering. • 18 items • Updated Oct 17, 2024 • 4

upvoted a paper over 1 year ago

SilverRetriever: Advancing Neural Passage Retrieval for Polish Question Answering

Paper • 2309.08469 • Published Sep 15, 2023 • 3

upvoted 7 papers almost 2 years ago

HerBERT: Efficiently Pretrained Transformer-based Language Model for Polish

Paper • 2105.01735 • Published May 4, 2021 • 1

Evaluation of Transfer Learning for Polish with a Text-to-Text Model

Paper • 2205.08808 • Published May 18, 2022 • 1

Going beyond research datasets: Novel intent discovery in the industry setting

Paper • 2305.05474 • Published May 9, 2023 • 1

KLEJ: Comprehensive Benchmark for Polish Language Understanding

Paper • 2005.00630 • Published May 1, 2020 • 1

Semi-Supervised Neural System for Tagging, Parsing and Lematization

Paper • 2004.12450 • Published Apr 26, 2020 • 1

MAUPQA: Massive Automatically-created Polish Question Answering Dataset

Paper • 2305.05486 • Published May 9, 2023 • 1

Improving Question Answering Performance through Manual Annotation: Costs, Benefits and Strategies

Paper • 2212.08897 • Published Dec 17, 2022 • 2