Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Adinosaur 's Collections
LLM structure optimization
LLM Evaluation
LLM Dataset
Deepseek R1
Image_Generation

LLM structure optimization

updated Apr 12
Upvote
-

  • InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

    Paper • 2502.08910 • Published Feb 13 • 149

  • Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

    Paper • 2502.06703 • Published Feb 10 • 155

  • The Curse of Depth in Large Language Models

    Paper • 2502.05795 • Published Feb 9 • 40
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs