DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting Paper • 2503.00784 • Published 11 days ago • 10 • 2
Full Parameter Fine-tuning for Large Language Models with Limited Resources Paper • 2306.09782 • Published Jun 16, 2023 • 30 • 4
Full Parameter Fine-tuning for Large Language Models with Limited Resources Paper • 2306.09782 • Published Jun 16, 2023 • 30 • 4