End of training

Browse files

Files changed (6) hide show

README.md +7 -37
config.json +1 -1
model.safetensors +1 -1
runs/Jan07_19-48-20_2c2be611b99b/events.out.tfevents.1704657001.2c2be611b99b.440.0 +3 -0
runs/Jan07_19-50-54_2c2be611b99b/events.out.tfevents.1704657065.2c2be611b99b.440.1 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilroberta-base](https://huggingface.co/distilroberta-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.3382
 ## Model description
@@ -40,47 +40,17 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 35
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 265  | 2.3628          |
-| 2.904         | 2.0   | 530  | 2.0845          |
-| 2.904         | 3.0   | 795  | 2.0250          |
-| 2.1137        | 4.0   | 1060 | 1.9491          |
-| 2.1137        | 5.0   | 1325 | 1.8606          |
-| 1.8476        | 6.0   | 1590 | 1.7733          |
-| 1.8476        | 7.0   | 1855 | 1.7121          |
-| 1.6693        | 8.0   | 2120 | 1.6887          |
-| 1.6693        | 9.0   | 2385 | 1.7146          |
-| 1.5233        | 10.0  | 2650 | 1.6577          |
-| 1.5233        | 11.0  | 2915 | 1.6567          |
-| 1.3942        | 12.0  | 3180 | 1.5722          |
-| 1.3942        | 13.0  | 3445 | 1.6422          |
-| 1.2933        | 14.0  | 3710 | 1.5093          |
-| 1.2933        | 15.0  | 3975 | 1.5304          |
-| 1.1985        | 16.0  | 4240 | 1.5071          |
-| 1.1058        | 17.0  | 4505 | 1.4910          |
-| 1.1058        | 18.0  | 4770 | 1.4515          |
-| 1.0401        | 19.0  | 5035 | 1.4776          |
-| 1.0401        | 20.0  | 5300 | 1.4555          |
-| 0.9783        | 21.0  | 5565 | 1.4275          |
-| 0.9783        | 22.0  | 5830 | 1.4391          |
-| 0.9074        | 23.0  | 6095 | 1.4229          |
-| 0.9074        | 24.0  | 6360 | 1.3791          |
-| 0.8672        | 25.0  | 6625 | 1.3893          |
-| 0.8672        | 26.0  | 6890 | 1.3848          |
-| 0.8392        | 27.0  | 7155 | 1.3653          |
-| 0.8392        | 28.0  | 7420 | 1.3756          |
-| 0.7796        | 29.0  | 7685 | 1.3914          |
-| 0.7796        | 30.0  | 7950 | 1.3098          |
-| 0.74          | 31.0  | 8215 | 1.3438          |
-| 0.74          | 32.0  | 8480 | 1.3633          |
-| 0.7368        | 33.0  | 8745 | 1.3185          |
-| 0.7036        | 34.0  | 9010 | 1.3183          |
-| 0.7036        | 35.0  | 9275 | 1.3382          |
 ### Framework versions

 This model is a fine-tuned version of [distilroberta-base](https://huggingface.co/distilroberta-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.5158
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 1.416         | 1.0   | 591  | 1.7960          |
+| 1.8915        | 2.0   | 1182 | 1.7130          |
+| 1.7498        | 3.0   | 1773 | 1.6111          |
+| 1.6478        | 4.0   | 2364 | 1.5306          |
+| 1.535         | 5.0   | 2955 | 1.5158          |
 ### Framework versions

config.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "_name_or_path": "distilroberta-base",
   "architectures": [
-    "RobertaForCausalLM"
   ],
   "attention_probs_dropout_prob": 0.1,
   "bos_token_id": 0,

 {
   "_name_or_path": "distilroberta-base",
   "architectures": [
+    "RobertaForMaskedLM"
   ],
   "attention_probs_dropout_prob": 0.1,
   "bos_token_id": 0,

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:eef4fb17af7889ef8b807c426b680dee76e38e7266ef8bfb41a247a47a19c23a
 size 328693404

 version https://git-lfs.github.com/spec/v1
+oid sha256:1089506d8801683d3e4bf95d9758311041fd6ba7a5154af0423542c2d8f41996
 size 328693404

runs/Jan07_19-48-20_2c2be611b99b/events.out.tfevents.1704657001.2c2be611b99b.440.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:af9a211b283bfd460af3bb3f7a6332d904e0155a8ec93be24b25371b6a070e2b
+size 4288

runs/Jan07_19-50-54_2c2be611b99b/events.out.tfevents.1704657065.2c2be611b99b.440.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c73f8814afa6d21aba8007ada3a5707bac9729b08c7702f1e7577fa550a4fe45
+size 6781

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:806708569b197b39154a7fe1b0a89cdf9ff1e5c3a56cc2038b62c80273b604aa
 size 4600

 version https://git-lfs.github.com/spec/v1
+oid sha256:97a896ab1bb651d4b0f13de8e6434a282624f25d247b2302e43dd350a952a223
 size 4600