Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
timaeus 's Collections
Datasets: Pile Subsets
Projects: Finetuning
Project: Lang2
Project: Lang1
Project: ICL1
Models: dh
Models: H-dh
Models: H
Models: L
Models: dm
Datasets: Suffixes
Datasets: Prefixes
Datasets: Delimiters
Datasets: Currencies
Datasets: Other

Models: dh

updated Oct 18, 2024

Attention-only transformers, sweep over head dimension

Upvote
-

  • timaeus/H8-dh4

    0.0B • Updated Oct 17, 2024 • 4

  • timaeus/H8-dh8

    0.0B • Updated Oct 17, 2024 • 5

  • timaeus/H8-dh16

    0.0B • Updated Oct 17, 2024 • 7

  • timaeus/L2

    0.0B • Updated Oct 18, 2024 • 4

  • timaeus/H8-dh64

    0.0B • Updated Oct 17, 2024 • 14

  • timaeus/H8-dh128

    0.0B • Updated Oct 17, 2024 • 12

  • timaeus/H8-dh256

    0.0B • Updated Oct 17, 2024 • 19
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs