Ahmed Masry's picture

17 5 4

Ahmed Masry PRO

ahmed-masry

·

https://ahmedmasryku.github.io/

Ahmed_Masry97

AI & ML interests

Multimodal Chart Understanding, Multimodal Document AI, Multimodal Vision - Language Models,

Recent Activity

published a Space about 1 month ago

ahmed-masry/Label-Studio

updated a Space about 1 month ago

ahmed-masry/Label-Studio

updated a model 3 months ago

ahmed-masry/UI-TARS-2B-SFT

View all activity

Organizations

upvoted a paper 5 months ago

ChartQAPro: A More Diverse and Challenging Benchmark for Chart Question Answering

Paper • 2504.05506 • Published Apr 7 • 24

upvoted a paper 7 months ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 147

upvoted a paper 8 months ago

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

Paper • 2502.01341 • Published Feb 3 • 38

upvoted a paper 10 months ago

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

Paper • 2412.04626 • Published Dec 5, 2024 • 14

upvoted a paper about 1 year ago

ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild

Paper • 2407.04172 • Published Jul 4, 2024 • 26