Artificial Hippocampus Networks for Efficient Long-Context Modeling Paper • 2510.07318 • Published 8 days ago • 26
Muon Outperforms Adam in Tail-End Associative Memory Learning Paper • 2509.26030 • Published 17 days ago • 18
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain Paper • 2509.26507 • Published 17 days ago • 478
Textbooks Are All You Need II: phi-1.5 technical report Paper • 2309.05463 • Published Sep 11, 2023 • 88