Jorsini commited on
Commit
4b1d174
·
1 Parent(s): d24341e

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [distilroberta-base](https://huggingface.co/distilroberta-base) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 1.5158
19
 
20
  ## Model description
21
 
@@ -35,22 +35,37 @@ More information needed
35
 
36
  The following hyperparameters were used during training:
37
  - learning_rate: 0.0001
38
- - train_batch_size: 8
39
- - eval_batch_size: 8
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
- - num_epochs: 5
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
- | 1.416 | 1.0 | 591 | 1.7960 |
50
- | 1.8915 | 2.0 | 1182 | 1.7130 |
51
- | 1.7498 | 3.0 | 1773 | 1.6111 |
52
- | 1.6478 | 4.0 | 2364 | 1.5306 |
53
- | 1.535 | 5.0 | 2955 | 1.5158 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
54
 
55
 
56
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [distilroberta-base](https://huggingface.co/distilroberta-base) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 1.2664
19
 
20
  ## Model description
21
 
 
35
 
36
  The following hyperparameters were used during training:
37
  - learning_rate: 0.0001
38
+ - train_batch_size: 32
39
+ - eval_batch_size: 32
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
+ - num_epochs: 20
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
+ | No log | 1.0 | 148 | 1.6991 |
50
+ | No log | 2.0 | 296 | 1.6279 |
51
+ | No log | 3.0 | 444 | 1.5855 |
52
+ | 1.7179 | 4.0 | 592 | 1.5846 |
53
+ | 1.7179 | 5.0 | 740 | 1.5261 |
54
+ | 1.7179 | 6.0 | 888 | 1.4618 |
55
+ | 1.5059 | 7.0 | 1036 | 1.4146 |
56
+ | 1.5059 | 8.0 | 1184 | 1.4289 |
57
+ | 1.5059 | 9.0 | 1332 | 1.4022 |
58
+ | 1.5059 | 10.0 | 1480 | 1.3688 |
59
+ | 1.3326 | 11.0 | 1628 | 1.3335 |
60
+ | 1.3326 | 12.0 | 1776 | 1.3669 |
61
+ | 1.3326 | 13.0 | 1924 | 1.2971 |
62
+ | 1.1997 | 14.0 | 2072 | 1.3146 |
63
+ | 1.1997 | 15.0 | 2220 | 1.3336 |
64
+ | 1.1997 | 16.0 | 2368 | 1.2793 |
65
+ | 1.1149 | 17.0 | 2516 | 1.2469 |
66
+ | 1.1149 | 18.0 | 2664 | 1.2395 |
67
+ | 1.1149 | 19.0 | 2812 | 1.2302 |
68
+ | 1.1149 | 20.0 | 2960 | 1.2664 |
69
 
70
 
71
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1089506d8801683d3e4bf95d9758311041fd6ba7a5154af0423542c2d8f41996
3
  size 328693404
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a534cf8710d4991bbdfe568d92b227a1259180b9638e3c463495396196c2c381
3
  size 328693404
runs/Jan07_21-16-25_402eeb12edff/events.out.tfevents.1704662198.402eeb12edff.295.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:62bed5fb62926aba90919255d52ecebbabbf6784d1172f56c25d0408b64f6781
3
+ size 4288
runs/Jan07_21-17-42_402eeb12edff/events.out.tfevents.1704662271.402eeb12edff.295.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8612e8b8a67c14bcb22c8b3793ae593ce1f807191301c57aec1b6ff70af49b37
3
+ size 4290
runs/Jan07_21-18-08_402eeb12edff/events.out.tfevents.1704662290.402eeb12edff.295.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:00a6298d011488ea746ecb17daab216e372bf3d3e266804448eef62d643120e5
3
+ size 10849
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:97a896ab1bb651d4b0f13de8e6434a282624f25d247b2302e43dd350a952a223
3
  size 4600
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bda99cfd1f618db57ab375072c5590e7b8de765040a3967e8b318e56beb0ee1f
3
  size 4600