Learning Dynamics in Continual Pre-Training for Large Language Models Paper • 2505.07796 • Published May 12 • 19