Prepare version of SmolLM2 models with MLA (Multihead latent attention)

#9
by verion1 - opened
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment