Energy-Based Transformers are Scalable Learners and Thinkers Paper • 2507.02092 • Published 7 days ago • 43 • 7
DarwinLM: Evolutionary Structured Pruning of Large Language Models Paper • 2502.07780 • Published Feb 11 • 18 • 7