DB-LLM: Accurate Dual-Binarization for Efficient LLMs Paper • 2402.11960 • Published Feb 19, 2024 • 2
How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study Paper • 2404.14047 • Published Apr 22, 2024 • 46