Nano Llama of 1.7B parameters.

Built with BuildNanoGPT-Plus.

Trained on 25.8B tokens.

Evaluation

Hellaswag: 0.53

Validation loss of Cross Entropy: 2.46

License

This model is available under the Apache 2.0 License.

Discord Server

Join our Discord server here.

Feeling Generous? 😊

Eager to buy me a cup of 2$ coffee or iced tea?πŸ΅β˜• Sure, here is the link: https://ko-fi.com/drnicefellow. Please add a note on which one you want me to drink?

Downloads last month
-
Safetensors
Model size
2B params
Tensor type
F32
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support