view post Post 1803 Reply florence-tool (https://github.com/bigdata-pw/florence-tool) now supports WebDataset! Check it out for efficient batch inference with Florence-2 models microsoft/Florence-2-large microsoft/Florence-2-baseCurrently running it myself on A40 with CAPTION task and a streaming WebDataset @ 60k images/hour!
view post Post 2152 Reply BIG update dropped for bigdata-pw/Flickr - now ~515M images! Target for the next update: 1BIn case you missed them; other recent drops include bigdata-pw/Dinosaurs - a small set of BIG creatures ๐ฆ๐ฆ and the first in a series of articles about the art of web scraping! https://huggingface.co/blog/hlky/web-scraping-101 https://huggingface.co/blog/hlky/web-scraping-102 Stay tuned for exciting datasets and models coming soon:- PC and Console game screenshots- TV/Film actors biographies and photos (think facial recognition and automatic captioning!)- bigdata-pw/lyrics-gpt v2- and more!