MaziyarPanahi/Llama-Nemotron-Post-Training-Dataset-v1-ShareGPT Viewer • Updated 11 days ago • 30.2M • 1.26k • 31
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published Jan 14 • 282