HugoVoxx's picture
Upload 12 files
8758510 verified
raw
history blame
82 Bytes
# Use 13 layers, for comparison against recurrent transformers.
NUM_LAYERS = 13