Update README.md
Browse files
README.md
CHANGED
@@ -5,4 +5,6 @@ datasets:
|
|
5 |
language:
|
6 |
- en
|
7 |
---
|
8 |
-
Model of the paper [MoM: Linear Sequence Modeling with Mixture-of-Memories](https://arxiv.org/abs/2502.13685).
|
|
|
|
|
|
5 |
language:
|
6 |
- en
|
7 |
---
|
8 |
+
Model of the paper [MoM: Linear Sequence Modeling with Mixture-of-Memories](https://arxiv.org/abs/2502.13685).
|
9 |
+
|
10 |
+
The model was trained on a sample of SlimPajama with 100B tokens. We use Gated-Deltanet as the memory update mechanism.
|