-
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU
Paper • 2502.08910 • Published • 149 -
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling
Paper • 2502.06703 • Published • 155 -
The Curse of Depth in Large Language Models
Paper • 2502.05795 • Published • 40
Shenxin Li
Adinosaur
·
AI & ML interests
None yet
Recent Activity
updated
a model
about 2 months ago
Adinosaur/tools
published
a model
about 2 months ago
Adinosaur/tools
updated
a model
about 2 months ago
Adinosaur/AAAI