Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
timaeus
's Collections
Datasets: Pile Subsets
Projects: Finetuning
Project: Lang2
Project: Lang1
Project: ICL1
Models: dh
Models: H-dh
Models: H
Models: L
Models: dm
Datasets: Suffixes
Datasets: Prefixes
Datasets: Delimiters
Datasets: Currencies
Datasets: Other
Models: dh
updated
Oct 18, 2024
Attention-only transformers, sweep over head dimension
Upvote
-
timaeus/H8-dh4
0.0B
•
Updated
Oct 17, 2024
•
4
timaeus/H8-dh8
0.0B
•
Updated
Oct 17, 2024
•
5
timaeus/H8-dh16
0.0B
•
Updated
Oct 17, 2024
•
7
timaeus/L2
0.0B
•
Updated
Oct 18, 2024
•
4
timaeus/H8-dh64
0.0B
•
Updated
Oct 17, 2024
•
14
timaeus/H8-dh128
0.0B
•
Updated
Oct 17, 2024
•
12
timaeus/H8-dh256
0.0B
•
Updated
Oct 17, 2024
•
19
Upvote
-
Share collection
View history
Collection guide
Browse collections