Seq vs Seq: An Open Suite of Paired Encoders and Decoders
Paper
•
2507.11412
•
Published
•
25
A collection of SOTA, open-data, paired encoder-only and decoder only models ranging from 17M params to 1B. See the paper at https://arxiv.org/abs/250
Note Encoder-only models
Note Decoder-only models
Note Data for pre-training and ordering per checkpoint
Note Encoders from Decoders
Note Decoders from Encoders