Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
timaeus 's Collections
Datasets: Pile Subsets
Projects: Finetuning
Project: Lang2
Project: Lang1
Project: ICL1
Models: dh
Models: H-dh
Models: H
Models: L
Models: dm
Datasets: Suffixes
Datasets: Prefixes
Datasets: Delimiters
Datasets: Currencies
Datasets: Other

Models: H

updated Oct 18, 2024

Attention-only transformers, sweep over number of heads (for fixed head dimension)

Upvote
-

  • timaeus/H1-dh32

    0.0B • Updated Oct 18, 2024 • 4

  • timaeus/H2-dh32

    0.0B • Updated Oct 17, 2024 • 3

  • timaeus/H4-dh32

    0.0B • Updated Oct 17, 2024 • 4

  • timaeus/L2

    0.0B • Updated Oct 18, 2024 • 4

  • timaeus/H16-dh32

    0.0B • Updated Oct 17, 2024 • 3

  • timaeus/H32-dh32

    0.0B • Updated Oct 17, 2024 • 4

  • timaeus/H64-dh32

    0.0B • Updated Oct 17, 2024 • 6
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs