38 14 188

Oliver Guhr

oliverguhr

https://www.impact-labs.ai

AI & ML interests

Voice Interfaces, Robotics, Deep Learning

Recent Activity

updated a model 4 days ago

oliverguhr/revosax-granite-embedding-278m-multilingual

published a model 4 days ago

oliverguhr/revosax-granite-embedding-278m-multilingual

liked a model 11 days ago

ibm-granite/granite-embedding-278m-multilingual

View all activity

Organizations

oliverguhr's activity

upvoted a collection 23 days ago

Gemma 3 QAT

Collection

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 30 days ago • 192

upvoted an article about 1 month ago

Article

EuroLLM-9B

and 5 others •

Dec 2, 2024

• 119

upvoted an article 3 months ago

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 305

upvoted a paper 8 months ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 52

upvoted a paper 12 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 89

upvoted a collection 12 months ago

Granite Code Models

Collection

A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 23 items • Updated 16 days ago • 191

upvoted a collection about 1 year ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 765

upvoted 3 papers about 1 year ago

upvoted 3 papers over 1 year ago

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 143

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 58

Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 83

upvoted a paper almost 2 years ago

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Paper • 2307.01952 • Published Jul 4, 2023 • 86