This repository is publicly accessible, but you have to accept the conditions to access its files and content.
Log in or Sign Up to review the conditions and access this model content.
Models trained using litgpt and AxoNN on AMD MI250 GPUs.
Train and validation data is taken from non-overlapping subsets of dolma.