Shot categorizer Collection Fine-tune of Florence-2 to generate shot categories, useful for data curation. Code: https://github.com/huggingface/movie-shot-categorizer. • 3 items • Updated 19 days ago • 2
video-effects datasets Collection Smol datasets to emulate cool video effects like "squish", "dissolve", etc. Inspired by Pika effects. • 4 items • Updated Jan 28 • 3
Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget Paper • 2407.15811 • Published Jul 22, 2024 • 2
AI Tools for Art - Feb '25 Collection Tools & models from the 2nd issue of AI Tools for Art 🎉 Read more about February's releases: https://open.substack.com/pub/multimodalaiart • 18 items • Updated 24 days ago • 1
Remote VAE Inference Endpoints Collection Models and handler code used in https://huggingface.co/blog/remote_vae • 5 items • Updated 14 days ago • 4
AnyText2: Visual Text Generation and Editing With Customizable Attributes Paper • 2411.15245 • Published Nov 22, 2024 • 1
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published Feb 20 • 133
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Paper • 2501.09732 • Published Jan 16 • 70
video-effects Collection Fine-tunes of open video generation models like CogVideoX to emulate cool video effects like "squish", "dissolve", "cakeify", etc. Pika inspired. • 8 items • Updated 16 days ago • 6