view article Article SigLIP 2: A better multilingual vision language encoder By ariG23498 and 2 others • Feb 21 • 171
💫StarVector Models Collection StarVector is a multimodal LLM for Scalable Vector Graphics (SVG) generation, producing structured SVG code directly from images and text. • 2 items • Updated Mar 20 • 96
view reply https://github.com/huggingface/smollm/blob/main/finetuning/Smol_VLM_FT.ipynbis not working, https://github.com/huggingface/smollm/blob/main/vision/finetuning/Smol_VLM_FT.ipynbseems to be the correct one.
view article Article SmolVLM Grows Smaller – Introducing the 250M & 500M Models! By andito and 2 others • Jan 23 • 181
view article Article SmolVLM - small yet mighty Vision Language Model By andito and 4 others • Nov 26, 2024 • 326
Running on Zero 90 90 catvton-flux 🖥 Generate virtual try-on images by masking and overlaying garments