LLM structure optimization - a Adinosaur Collection

Adinosaur 's Collections

LLM structure optimization

Image_Generation

LLM structure optimization

updated Apr 12

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published Feb 13 • 149
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 155
The Curse of Depth in Large Language Models

Paper • 2502.05795 • Published Feb 9 • 40