Scalable Chain of Thoughts via Elastic Reasoning Paper • 2505.05315 • Published 6 days ago • 22
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models Paper • 2309.14717 • Published Sep 26, 2023 • 44