Szymon Ruciński's picture

7 5 42

Szymon Ruciński

szymonrucinski

·

https://szymonrucinski.pl

szymonrucinski

AI & ML interests

NLP & Computer Vision

Recent Activity

published a model about 1 month ago

szymonrucinski/Apertus-8B-2509-GGUF

upvoted a collection about 1 month ago

liked a dataset about 2 months ago

benjamin-paine/imagenet-1k-256x256

View all activity

Organizations

upvoted a collection about 1 month ago

Apertus LLM

Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated 4 days ago • 279

upvoted 3 papers over 1 year ago

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 188

Efficient Language Adaptive Pre-training: Extending State-of-the-Art Large Language Models for Polish

Paper • 2402.09759 • Published Feb 15, 2024 • 1

MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

Paper • 2401.04081 • Published Jan 8, 2024 • 73

upvoted a paper almost 2 years ago

Exponentially Faster Language Modelling

Paper • 2311.10770 • Published Nov 15, 2023 • 119