Shakker-Labs/FLUX.1-dev-LoRA-Children-Simple-Sketch Text-to-Image β’ Updated Sep 11, 2024 β’ 1.81k β’ β’ 112
view article Article SigLIP 2: A better multilingual vision language encoder By ariG23498 and 2 others β’ Feb 21 β’ 181
π«StarVector Models Collection StarVector is a multimodal LLM for Scalable Vector Graphics (SVG) generation, producing structured SVG code directly from images and text. β’ 2 items β’ Updated Mar 20 β’ 97
view reply https://github.com/huggingface/smollm/blob/main/finetuning/Smol_VLM_FT.ipynbis not working, https://github.com/huggingface/smollm/blob/main/vision/finetuning/Smol_VLM_FT.ipynbseems to be the correct one.
view article Article SmolVLM Grows Smaller β Introducing the 250M & 500M Models! By andito and 2 others β’ Jan 23 β’ 183
view article Article SmolVLM - small yet mighty Vision Language Model By andito and 4 others β’ Nov 26, 2024 β’ 358
Runtime error 92 92 catvton-flux π₯ Generate virtual try-on images by masking and overlaying garments
yanka9/vilt_finetuned_deepfashionVQA_v2 Visual Question Answering β’ 0.1B β’ Updated Feb 16 β’ 105 β’ 5