Commit
·
110f60a
1
Parent(s):
9790fc9
Update README.md
Browse files
README.md
CHANGED
|
@@ -79,10 +79,10 @@ The model was trained on the following dataset:
|
|
| 79 |
- **Hardware:** 1 x Nvidia RTX 3050 8GB GPU
|
| 80 |
- **Hours Trained:** 520 hours approximately.
|
| 81 |
- **Optimizer:** AdamW
|
| 82 |
-
- **Gradient Accumulations**:
|
| 83 |
- **Batch:** 1
|
| 84 |
- **Learning rate:** warmup to 1e-7 for 10,000 steps and then kept constant
|
| 85 |
-
- **Total Training Steps:** 1,
|
| 86 |
|
| 87 |
Developed by: [ZeroCool94](https://huggingface.co/ZeroCool94) at [Sygil-Dev](https://github.com/Sygil-Dev/)
|
| 88 |
|
|
|
|
| 79 |
- **Hardware:** 1 x Nvidia RTX 3050 8GB GPU
|
| 80 |
- **Hours Trained:** 520 hours approximately.
|
| 81 |
- **Optimizer:** AdamW
|
| 82 |
+
- **Gradient Accumulations**: 4
|
| 83 |
- **Batch:** 1
|
| 84 |
- **Learning rate:** warmup to 1e-7 for 10,000 steps and then kept constant
|
| 85 |
+
- **Total Training Steps:** 1,489,983
|
| 86 |
|
| 87 |
Developed by: [ZeroCool94](https://huggingface.co/ZeroCool94) at [Sygil-Dev](https://github.com/Sygil-Dev/)
|
| 88 |
|