view article Article From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate By muellerzr and 3 others • Jun 13, 2024 • 55
Dynamic Pyramid Network for Efficient Multimodal Large Language Model Paper • 2503.20322 • Published Mar 26
Running 110 110 Open VLM Video Leaderboard 🌎 VLMEvalKit Eval Results in video understanding benchmark
InstantIR: Blind Image Restoration with Instant Generative Reference Paper • 2410.06551 • Published Oct 9, 2024 • 6
CSGO: Content-Style Composition in Text-to-Image Generation Paper • 2408.16766 • Published Aug 29, 2024 • 18
CSGO: Content-Style Composition in Text-to-Image Generation Paper • 2408.16766 • Published Aug 29, 2024 • 18