merve's picture

merve PRO

merve

·

https://github.com/merveenoyan/smol-vision

AI & ML interests

I love this website VLMs, vision & co

Recent Activity

liked a Space about 19 hours ago

yonigozlan/Segment-Anything-2-video-tracking

liked a model 1 day ago

fancyfeast/llama-joycaption-beta-one-hf-llava

liked a model 1 day ago

BLIP3o/BLIP3o-NEXT-Pretrain-3B

View all activity

Organizations

Posts 153

Post

2568

GPT-4.1-mini level model right in your iPhone 🤯

openbmb/MiniCPM-V-4 is only 4B while surpassing GPT-4.1-mini in vision benchmarks 🔥

allows commercial use as well!

Articles 33

Article

44

Vision Language Model Alignment in TRL ⚡️

View all Articles

Collections 68

View 68 collections

spaces 107

Vision Papers

All paper summaries read by Merve

No application file

Test2

Llama Guard 4

Check if text and images are safe

Running on Zero

ShieldGemma2 VLM

Demo for ShieldGemma 2, multimodal safety model

UDOP

Generate text from document images

Running on Zero

Paligemma2 Vqav2

PaliGemma2 LoRA finetuned on VQAv2

View 107 Spaces

models 98

merve/Qwen2.5-VL-3B-Instruct-trl-mpo-rlaif-v

Updated 22 days ago

merve/smol-vision

Image-Text-to-Text • Updated 22 days ago • 95

merve/Qwen2.5-VL-7B-Instruct-trl-mpo-rlaif-v

Updated 23 days ago

merve/gemma-3n-finevideo

Updated 29 days ago • 7

merve/vjepa2-vitl-fpc16-256-ssv2-ucf101

Video Classification • 0.4B • Updated Jun 13 • 10

merve/test

merve/SmolVLM2-500M-Video-Instruct-video-feedback

Image-Text-to-Text • 0.5B • Updated Feb 20 • 6

merve/SmolVLM2-500M-Video-Instruct-videofeedback

Image-Text-to-Text • 0.5B • Updated Feb 20 • 4

merve/SmolVLM2-500M-Video-Instruct-emotions

Image-Text-to-Text • 0.5B • Updated Feb 20 • 5

merve/colpali_ufo

Updated Dec 20, 2024 • 6

datasets 30

merve/vlm_test_images

Viewer • Updated 10 days ago • 19 • 1.17k • 2

merve/finevideo-split

Viewer • Updated Jul 9 • 3.14k • 101

merve/test2

Updated Jun 20 • 5

merve/retail-in-the-wild

Viewer • Updated Mar 6 • 20 • 56 • 3

merve/model-test-inputs

Updated Oct 21, 2024 • 26

merve/vqav2-small

Viewer • Updated Aug 8, 2024 • 21.4k • 1.23k • 12

merve/SGinW

Viewer • Updated Jul 11, 2024 • 16.7k • 367 • 1

merve/pascal-voc

Viewer • Updated Jul 6, 2024 • 336k • 703 • 1

merve/YouCook2

Viewer • Updated May 28, 2024 • 2k • 63

merve/faiss_embeddings

Updated Jan 25, 2024 • 19

View 30 datasets