Team-PIXEL

university

https://github.com/xplip/pixel

Activity Feed Request to join this org

AI & ML interests

Language modelling with pixels

Recent Activity

lyan62 authored a paper 15 days ago

Lost in Embeddings: Information Loss in Vision-Language Models

jflotz published a dataset 4 months ago

Team-PIXEL/rendered-bookcorpus-bigrams

jflotz published a dataset 4 months ago

Team-PIXEL/rendered-wiki_en-bigrams

View all activity

lyan62

authored a paper 15 days ago

Lost in Embeddings: Information Loss in Vision-Language Models

Paper • 2509.11986 • Published 16 days ago • 26

jflotz

published 2 datasets 4 months ago

Team-PIXEL/rendered-bookcorpus-bigrams

Viewer • Updated Apr 13, 2023 • 7.7M • 175

Team-PIXEL/rendered-wiki_en-bigrams

Viewer • Updated Apr 14, 2023 • 13.4M • 365

lyan62

authored a paper 4 months ago

Hanfu-Bench: A Multimodal Benchmark on Cross-Temporal Cultural Understanding and Transcreation

Paper • 2506.01565 • Published Jun 2 • 3

jflotz

published a model 4 months ago

Team-PIXEL/pixel-m4

Updated Dec 16, 2023 • 86

jflotz

authored 3 papers 6 months ago

plip

updated a Space 7 months ago

PIXEL

🐱

Generate text-masked images using PIXEL model

elliottd

authored a paper 7 months ago

Can Community Notes Replace Professional Fact-Checkers?

Paper • 2502.14132 • Published Feb 19 • 6

e-bug

authored a paper 10 months ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 133

lyan62

authored 3 papers about 1 year ago

FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture

Paper • 2406.11030 • Published Jun 16, 2024

Understanding Retrieval Robustness for Retrieval-Augmented Image Captioning

Paper • 2406.02265 • Published Jun 4, 2024 • 7

The Role of Data Curation in Image Captioning

Paper • 2305.03610 • Published May 5, 2023

e-bug

authored a paper about 1 year ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 72

e-bug

authored a paper over 1 year ago

Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings

Paper • 2404.16820 • Published Apr 25, 2024 • 17

ilkerkesen

authored a paper over 1 year ago

ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models

Paper • 2311.07022 • Published Nov 13, 2023 • 1

jflotz

updated 3 datasets over 1 year ago

Team-PIXEL/PIXELSum_zh_wiki_for_TA

Viewer • Updated Jan 21, 2024 • 2.56M • 62

Team-PIXEL/PIXELSum_hi_wiki_for_TA

Viewer • Updated Jan 21, 2024 • 450k • 67

Team-PIXEL/PIXELSum_en_wiki_for_TA

Viewer • Updated Jan 18, 2024 • 29.4M • 126

AI & ML interests

Recent Activity

Team members 13

Team-PIXEL's activity

PIXEL