Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
YuanZ77 's Collections
Papers

Papers

updated Feb 20
Upvote
-

  • Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

    Paper • 2304.01373 • Published Apr 3, 2023 • 9

  • Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

    Paper • 2502.11089 • Published Feb 16 • 165
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs