Lucas Beyer's picture

19 1 2

Lucas Beyer

giffmana

·

http://lucasb.eyer.be

AI & ML interests

None yet

Recent Activity

commented on a paper 24 days ago

PaLI: A Jointly-Scaled Multilingual Language-Image Model

authored a paper 3 months ago

Gemma 3 Technical Report

new activity 4 months ago

google/siglip-so400m-patch14-384:Is SiglipImageProcessor configured correctly?

View all activity

Organizations

authored a paper 3 months ago

Gemma 3 Technical Report

Paper • 2503.19786 • Published Mar 25 • 52

authored a paper 4 months ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 146

authored a paper 7 months ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 134

authored a paper 12 months ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 71

authored 16 papers over 1 year ago

A Large-scale Study of Representation Learning with the Visual Task Adaptation Benchmark

Paper • 1910.04867 • Published Oct 1, 2019

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Paper • 2010.11929 • Published Oct 22, 2020 • 10

Big Transfer (BiT): General Visual Representation Learning

Paper • 1912.11370 • Published Dec 24, 2019 • 1

FlexiViT: One Model for All Patch Sizes

Paper • 2212.08013 • Published Dec 15, 2022 • 1

MLP-Mixer: An all-MLP Architecture for Vision

Paper • 2105.01601 • Published May 4, 2021

Knowledge distillation: A good teacher is patient and consistent

Paper • 2106.05237 • Published Jun 9, 2021

Image Captioners Are Scalable Vision Learners Too

Paper • 2306.07915 • Published Jun 13, 2023 • 11

Scaling Vision Transformers to 22 Billion Parameters

Paper • 2302.05442 • Published Feb 10, 2023 • 2

Tuning computer vision models with task rewards

Paper • 2302.08242 • Published Feb 16, 2023

Sigmoid Loss for Language Image Pre-Training

Paper • 2303.15343 • Published Mar 27, 2023 • 8

The Efficiency Misnomer

Paper • 2110.12894 • Published Oct 25, 2021

Kubric: A scalable dataset generator

Paper • 2203.03570 • Published Mar 7, 2022 • 1

PaLI: A Jointly-Scaled Multilingual Language-Image Model

Paper • 2209.06794 • Published Sep 14, 2022 • 2

Revisiting Self-Supervised Visual Representation Learning

Paper • 1901.09005 • Published Jan 25, 2019

PaLI-3 Vision Language Models: Smaller, Faster, Stronger

Paper • 2310.09199 • Published Oct 13, 2023 • 29

Better plain ViT baselines for ImageNet-1k

Paper • 2205.01580 • Published May 3, 2022