jhu-clsp 's Collections

Encoders vs Decoders: the Ettin Suite

A collection of SOTA, open-data, paired encoder-only and decoder only models ranging from 17M params to 1B. See the paper at https://arxiv.org/abs/250