view article Article From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate By muellerzr and 3 others • Jun 13, 2024 • 55
view article Article Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model By VictorSanh and 10 others • Aug 22, 2023 • 35
view article Article Incredibly Fast BLOOM Inference with DeepSpeed and Accelerate By stas and 1 other • Sep 16, 2022 • 1
view article Article Fit More and Train Faster With ZeRO via DeepSpeed and FairScale By stas • Jan 19, 2021 • 4
view article Article Porting fairseq wmt19 translation system to transformers By stas • Nov 3, 2020 • 1