scbtm commited on
Commit
e657154
·
verified ·
1 Parent(s): 99fee42

End of training

Browse files
README.md CHANGED
@@ -19,9 +19,9 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 0.6800
23
- - Accuracy: 0.6953
24
- - F1: 0.6945
25
 
26
  ## Model description
27
 
@@ -40,9 +40,9 @@ More information needed
40
  ### Training hyperparameters
41
 
42
  The following hyperparameters were used during training:
43
- - learning_rate: 2e-05
44
- - train_batch_size: 32
45
- - eval_batch_size: 32
46
  - seed: 42
47
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.98) and epsilon=1e-06 and optimizer_args=No additional optimizer arguments
48
  - lr_scheduler_type: linear
@@ -53,27 +53,22 @@ The following hyperparameters were used during training:
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
55
  |:-------------:|:------:|:----:|:---------------:|:--------:|:------:|
56
- | 1.2688 | 0.0590 | 250 | 1.1315 | 0.4781 | 0.4463 |
57
- | 1.0574 | 0.1181 | 500 | 0.9664 | 0.5575 | 0.5412 |
58
- | 0.9229 | 0.1771 | 750 | 0.8647 | 0.6070 | 0.6007 |
59
- | 0.8654 | 0.2361 | 1000 | 0.8665 | 0.6089 | 0.5922 |
60
- | 0.8229 | 0.2952 | 1250 | 0.7857 | 0.6448 | 0.6448 |
61
- | 0.8054 | 0.3542 | 1500 | 0.8515 | 0.6218 | 0.5993 |
62
- | 0.786 | 0.4132 | 1750 | 0.7533 | 0.6601 | 0.6552 |
63
- | 0.781 | 0.4723 | 2000 | 0.8133 | 0.6305 | 0.6278 |
64
- | 0.7563 | 0.5313 | 2250 | 0.7770 | 0.6480 | 0.6473 |
65
- | 0.7638 | 0.5903 | 2500 | 0.7248 | 0.6767 | 0.6769 |
66
- | 0.7384 | 0.6494 | 2750 | 0.7520 | 0.6597 | 0.6574 |
67
- | 0.7405 | 0.7084 | 3000 | 0.7615 | 0.6545 | 0.6515 |
68
- | 0.7222 | 0.7674 | 3250 | 0.7191 | 0.6790 | 0.6716 |
69
- | 0.7184 | 0.8264 | 3500 | 0.7037 | 0.6862 | 0.6837 |
70
- | 0.6984 | 0.8855 | 3750 | 0.7264 | 0.6716 | 0.6678 |
71
- | 0.6995 | 0.9445 | 4000 | 0.7455 | 0.6663 | 0.6646 |
72
- | 0.713 | 1.0035 | 4250 | 0.7294 | 0.6752 | 0.6701 |
73
- | 0.6508 | 1.0626 | 4500 | 0.6938 | 0.6872 | 0.6871 |
74
- | 0.642 | 1.1216 | 4750 | 0.7266 | 0.6716 | 0.6691 |
75
- | 0.635 | 1.1806 | 5000 | 0.6868 | 0.6913 | 0.6900 |
76
- | 0.6278 | 1.2397 | 5250 | 0.6800 | 0.6953 | 0.6945 |
77
 
78
 
79
  ### Framework versions
 
19
 
20
  This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 0.8255
23
+ - Accuracy: 0.6865
24
+ - F1: 0.6873
25
 
26
  ## Model description
27
 
 
40
  ### Training hyperparameters
41
 
42
  The following hyperparameters were used during training:
43
+ - learning_rate: 8e-05
44
+ - train_batch_size: 64
45
+ - eval_batch_size: 64
46
  - seed: 42
47
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.98) and epsilon=1e-06 and optimizer_args=No additional optimizer arguments
48
  - lr_scheduler_type: linear
 
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
55
  |:-------------:|:------:|:----:|:---------------:|:--------:|:------:|
56
+ | 1.0765 | 0.1653 | 350 | 0.8973 | 0.5849 | 0.5797 |
57
+ | 0.848 | 0.3305 | 700 | 0.7721 | 0.6516 | 0.6483 |
58
+ | 0.7796 | 0.4958 | 1050 | 0.7682 | 0.6466 | 0.6470 |
59
+ | 0.7671 | 0.6610 | 1400 | 0.7448 | 0.6611 | 0.6566 |
60
+ | 0.7434 | 0.8263 | 1750 | 0.7378 | 0.6643 | 0.6634 |
61
+ | 0.7232 | 0.9915 | 2100 | 0.7086 | 0.6789 | 0.6736 |
62
+ | 0.653 | 1.1568 | 2450 | 0.7150 | 0.6768 | 0.6764 |
63
+ | 0.6312 | 1.3220 | 2800 | 0.7119 | 0.6785 | 0.6761 |
64
+ | 0.6298 | 1.4873 | 3150 | 0.6982 | 0.6879 | 0.6843 |
65
+ | 0.6307 | 1.6525 | 3500 | 0.7072 | 0.6863 | 0.6864 |
66
+ | 0.6338 | 1.8178 | 3850 | 0.6950 | 0.6862 | 0.6813 |
67
+ | 0.6252 | 1.9830 | 4200 | 0.6996 | 0.6850 | 0.6853 |
68
+ | 0.4418 | 2.1483 | 4550 | 0.8353 | 0.6911 | 0.6899 |
69
+ | 0.4016 | 2.3135 | 4900 | 0.8428 | 0.6825 | 0.6815 |
70
+ | 0.404 | 2.4788 | 5250 | 0.8241 | 0.6824 | 0.6822 |
71
+ | 0.404 | 2.6440 | 5600 | 0.8255 | 0.6865 | 0.6873 |
 
 
 
 
 
72
 
73
 
74
  ### Framework versions
runs/Jan21_15-19-05_f3acabd62f42/events.out.tfevents.1737476068.f3acabd62f42.35586.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6292a2989249493e834794982034de07429ca1dd0e62b9f786284e407db850cb
3
+ size 457