merve's picture

Building on HF

merve PRO

merve

huggingface

·

https://github.com/merveenoyan/smol-vision

AI & ML interests

I love this website VLMs, vision & co

Recent Activity

updated a model 1 day ago

merve/gemma4-multiimage-thinking-lora

published a model 1 day ago

merve/gemma4-multiimage-thinking-lora

liked a model 1 day ago

nvidia/nemotron-ocr-v2

View all activity

Organizations

published an article 7 days ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

+5

7 days ago

•

786

published an article 13 days ago

Article

Liberate your OpenClaw

+6

13 days ago

•

40

published an article about 1 month ago

Article

Mixture of Experts (MoEs) in Transformers

+5

Feb 26

•

149

published an article 2 months ago

Article

Community Evals: Because we're done trusting black-box leaderboards over the community

+5

Feb 4

•

88

published an article 2 months ago

Article

Introducing Daggr: Chain apps programmatically, inspect visually

+3

Jan 29

•

106

published an article 2 months ago

Article

We Got Claude to Build CUDA Kernels and teach open models!

+2

Jan 28

•

152

published an article 3 months ago

Article

Open Responses: What you need to know

+2

Jan 15

•

111

published an article 4 months ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

+4

Dec 18, 2025

•

124

published an article 5 months ago

Article

Streaming datasets: 100x More Efficient

+3

Oct 27, 2025

•

85

published an article 6 months ago

Article

Supercharge your OCR Pipelines with Open Models

+5

Oct 21, 2025

•

308

published an article 7 months ago

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

+3

Sep 23, 2025

•

137

published an article 8 months ago

Article

Vision Language Model Alignment in TRL ⚡️

+3

Aug 7, 2025

•

109

published an article 9 months ago

Article

Introducing ColQwen-Omni: Retrieve in every modality

Jul 17, 2025

•

76

published an article 10 months ago

Article

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

+3

Jun 19, 2025

•

101

published an article 10 months ago

Article

Learn the Hugging Face Kernel Hub in 5 Minutes

+5

Jun 12, 2025

•

163

published an article 10 months ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

+7

Jun 3, 2025

•

343

published an article 11 months ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

+5

May 21, 2025

•

254

published an article 11 months ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

+5

May 21, 2025

•

254

published an article 11 months ago

Article

Vision Language Models (Better, faster, stronger)

+3

May 12, 2025

•

605

published an article 12 months ago

Article

Welcoming Llama Guard 4 on Hugging Face Hub

+2

Apr 29, 2025

•

41