Info

This is the model mistralai/Mistral-7B-Instruct-v0.2 which I cut all the intermediate(feed_forward_length) size with 14336 down to 3072, resulting in a ~2.81B model.

It's necessary to pre-train this model, cause at the moment is generating just gibberish.

Downloads last month
223
Safetensors
Model size
2.81B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Space using Aryanne/Mistral-3B-Instruct-v0.2-init 1