Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models Paper • 2504.04823 • Published 9 days ago • 28
Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models Paper • 2504.04823 • Published 9 days ago • 28
Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models Paper • 2504.04823 • Published 9 days ago • 28 • 2
FlatQuant: Flatness Matters for LLM Quantization Paper • 2410.09426 • Published Oct 12, 2024 • 15 • 2
IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact Paper • 2403.01241 • Published Mar 2, 2024 • 1