Gemstone-256x23 / README.md
smcleish's picture
Upload GemmaForCausalLM
9158c8c verified
|
raw
history blame
416 Bytes
metadata
datasets:
  - allenai/dolma
language:
  - en
library_name: transformers
license: apache-2.0
tags:
  - causal-lm

Model Details

Training

Models trained using litgpt and AxoNN on AMD MI250 GPUs.

Data

Train and validation data is taken from non-overlapping subsets of dolma.