merve
·
AI & ML interests
I love this website
VLMs, vision & co
Recent Activity
Organizations
-
-
-
-
-
-
-
-
-
-
-
view article
Introducing ColQwen-Omni: Retrieve in every modality
By
and 4 others
•
•
58
published
an
article
about 1 month ago
view article
(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware
published
an
article
about 1 month ago
view article
Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub
By
and 6 others
•
•
115
published
an
article
about 2 months ago
view article
SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data
view article
nanoVLM: The simplest repository to train your VLM in pure PyTorch
view article
Vision Language Models (Better, Faster, Stronger)
view article
Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM
view article
SigLIP 2: A better multilingual vision language encoder
view article
SmolVLM2: Bringing Video Understanding to Every Device
view article
Open-source DeepResearch – Freeing our search agents
view article
SmolVLM Grows Smaller – Introducing the 250M & 500M Models!
view article
Introducing smolagents: simple agents that write actions in code.
view article
Welcome PaliGemma 2 – New vision language models by Google
view article
SmolVLM - small yet mighty Vision Language Model
view article
Llama can now see and run on your device - welcome Llama 3.2
published
an
article
about 1 year ago
view article
Preference Optimization for Vision Language Models
published
an
article
about 1 year ago
view article
Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models