MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published Jan 14 • 296
Lizard: An Efficient Linearization Framework for Large Language Models Paper • 2507.09025 • Published 12 days ago • 16