Model Architecture Details

#24
by nbaligar - opened

Where can I find more architectural details (QKV size, vocabulary size etc) for this model?

Sign up or log in to comment