Post
201
space 🌌 a new recursive language model architecture that outperforms bigger vanilla transformers on several fronts: size, perplexity and validation loss... sounds too good to be true right?
new blog and details coming soon
new blog and details coming soon