Update README.md
Browse files
README.md
CHANGED
@@ -6,4 +6,5 @@ language:
|
|
6 |
- en
|
7 |
---
|
8 |
Model of the paper [MoM: Linear Sequence Modeling with Mixture-of-Memories](https://arxiv.org/abs/2502.13685).
|
|
|
9 |
The model was trained on a sample of SlimPajama with 15B tokens. We use Gated-Deltanet as the memory update mechanism.
|
|
|
6 |
- en
|
7 |
---
|
8 |
Model of the paper [MoM: Linear Sequence Modeling with Mixture-of-Memories](https://arxiv.org/abs/2502.13685).
|
9 |
+
|
10 |
The model was trained on a sample of SlimPajama with 15B tokens. We use Gated-Deltanet as the memory update mechanism.
|