language: - en tags: - causal-lm library_name: transformers license: apache-2.0 datasets: - allenai/dolma
Models trained using litgpt and AxoNN on AMD MI250 GPUs.
Train and validation data is taken from non-overlapping subsets of dolma.