Matthias Minderer's picture

35 1

Matthias Minderer

mjlm

·

https://matthias.minderer.net

mjlm3
mjlm

AI & ML interests

Vision and language models

Organizations

authored a paper 7 months ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 134

authored a paper 12 months ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 72

authored a paper over 1 year ago

Improving fine-grained understanding in image-text pre-training

Paper • 2401.09865 • Published Jan 18, 2024 • 18

authored a paper almost 2 years ago

Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution

Paper • 2307.06304 • Published Jul 12, 2023 • 30

authored a paper about 2 years ago

Scaling Open-Vocabulary Object Detection

Paper • 2306.09683 • Published Jun 16, 2023 • 13