view article Article Zero-shot image segmentation with CLIPSeg By tobiasc and 1 other • Dec 21, 2022 • 9
view article Article The Falcon has landed in the Hugging Face ecosystem By lvwerra and 7 others • Jun 5, 2023 • 14
view article Article makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch By AviSoori1x • May 7, 2024 • 84
OS-Copilot: Towards Generalist Computer Agents with Self-Improvement Paper • 2402.07456 • Published Feb 12, 2024 • 46
view article Article Zero-shot image-to-text generation with BLIP-2 By MariaK and 1 other • Feb 15, 2023 • 21
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community By Leyo and 2 others • Apr 15, 2024 • 181
view article Article Fine tuning CLIP with Remote Sensing (Satellite) images and captions By arampacha and 5 others • Oct 13, 2021 • 7
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models Paper • 2402.17177 • Published Feb 27, 2024 • 89