dfurman
/

Llama-2-70B-Instruct-v0.1

Text Generation

Model card Files Files and versions Community

dfurman commited on Nov 18, 2023

Commit

2e43fdd

·

1 Parent(s): 7acb51a

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -128,7 +128,7 @@ Example 3:
 ## Model description
-The architecture is a modification of a standard decoder-only transformer.
 The llama-2-70b models have been modified from a standard transformer in the following ways:
 * It uses the [SwiGLU activation function](https://arxiv.org/abs/2002.05202)

 ## Model description
+The architecture is a modification of a standard decoder-only transformer and was trained as a causal language model (clm).
 The llama-2-70b models have been modified from a standard transformer in the following ways:
 * It uses the [SwiGLU activation function](https://arxiv.org/abs/2002.05202)