Commit
·
0887071
1
Parent(s):
a2c6fee
Update README.md
Browse files
README.md
CHANGED
@@ -6,4 +6,8 @@ language:
|
|
6 |
metrics:
|
7 |
- accuracy
|
8 |
pipeline_tag: fill-mask
|
9 |
-
---
|
|
|
|
|
|
|
|
|
|
6 |
metrics:
|
7 |
- accuracy
|
8 |
pipeline_tag: fill-mask
|
9 |
+
---
|
10 |
+
|
11 |
+
A small version of `DeBERTa` trained on the clean version of google C4 dataset. For more info about the size of the model, see `config.json`.
|
12 |
+
|
13 |
+
The model has been trained for **100K** steps with a batch size of **2048** and a sequence length of **512**, for a total of **104** billion tokens.
|