view article Article Faster Assisted Generation with Dynamic Speculation By jmamou and 6 others • Oct 8, 2024 • 48
Distributed Speculative Inference of Large Language Models Paper • 2405.14105 • Published May 23, 2024 • 19