Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
amenur 's Collections
Attention
Mamba+Transformers
Transformers

Attention

updated Mar 1, 2024
Upvote
-

  • System 2 Attention (is something you might need too)

    Paper • 2311.11829 • Published Nov 20, 2023 • 43

  • Transformers are Multi-State RNNs

    Paper • 2401.06104 • Published Jan 11, 2024 • 39

  • The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

    Paper • 2402.17764 • Published Feb 27, 2024 • 620
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs