Nano Llama of 1.7B parameters.
Built with BuildNanoGPT-Plus.
Trained on 25.8B tokens.
Evaluation
Hellaswag: 0.53
Validation loss of Cross Entropy: 2.46
License
This model is available under the Apache 2.0 License.
Discord Server
Join our Discord server here.
Feeling Generous? π
Eager to buy me a cup of 2$ coffee or iced tea?π΅β Sure, here is the link: https://ko-fi.com/drnicefellow. Please add a note on which one you want me to drink?
- Downloads last month
- -
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
π
Ask for provider support