smerchi commited on
Commit
663df54
·
verified ·
1 Parent(s): 63a61ee

End of training

Browse files
Files changed (1) hide show
  1. README.md +13 -15
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [SI2M-Lab/DarijaBERT](https://huggingface.co/SI2M-Lab/DarijaBERT) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 0.0247
19
 
20
  ## Model description
21
 
@@ -36,28 +36,26 @@ More information needed
36
  The following hyperparameters were used during training:
37
  - learning_rate: 2e-05
38
  - train_batch_size: 16
39
- - eval_batch_size: 16
40
  - seed: 42
41
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
42
  - lr_scheduler_type: linear
43
- - num_epochs: 12
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
- | No log | 1.0 | 141 | 0.0312 |
50
- | No log | 2.0 | 282 | 0.0781 |
51
- | No log | 3.0 | 423 | 0.0229 |
52
- | 0.1308 | 4.0 | 564 | 0.0187 |
53
- | 0.1308 | 5.0 | 705 | 0.0208 |
54
- | 0.1308 | 6.0 | 846 | 0.0261 |
55
- | 0.1308 | 7.0 | 987 | 0.0242 |
56
- | 0.0008 | 8.0 | 1128 | 0.0244 |
57
- | 0.0008 | 9.0 | 1269 | 0.0247 |
58
- | 0.0008 | 10.0 | 1410 | 0.0247 |
59
- | 0.0005 | 11.0 | 1551 | 0.0247 |
60
- | 0.0005 | 12.0 | 1692 | 0.0247 |
61
 
62
 
63
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [SI2M-Lab/DarijaBERT](https://huggingface.co/SI2M-Lab/DarijaBERT) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.0256
19
 
20
  ## Model description
21
 
 
36
  The following hyperparameters were used during training:
37
  - learning_rate: 2e-05
38
  - train_batch_size: 16
39
+ - eval_batch_size: 8
40
  - seed: 42
41
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
42
  - lr_scheduler_type: linear
43
+ - num_epochs: 10
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
+ | No log | 1.0 | 132 | 0.0484 |
50
+ | No log | 2.0 | 264 | 0.0225 |
51
+ | No log | 3.0 | 396 | 0.0269 |
52
+ | 0.1331 | 4.0 | 528 | 0.0217 |
53
+ | 0.1331 | 5.0 | 660 | 0.0240 |
54
+ | 0.1331 | 6.0 | 792 | 0.0248 |
55
+ | 0.1331 | 7.0 | 924 | 0.0250 |
56
+ | 0.0014 | 8.0 | 1056 | 0.0253 |
57
+ | 0.0014 | 9.0 | 1188 | 0.0256 |
58
+ | 0.0014 | 10.0 | 1320 | 0.0256 |
 
 
59
 
60
 
61
  ### Framework versions