pabRomero
/

BERT-full-finetuned-ner-pablo

@@ -21,11 +21,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google-bert/bert-base-uncased](https://huggingface.co/google-bert/bert-base-uncased) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7097
-- Precision: 0.0
-- Recall: 0.0
-- F1: 0.0
-- Accuracy: 0.8695
 ## Model description
@@ -44,10 +44,12 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0004
-- train_batch_size: 512
-- eval_batch_size: 512
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.05
@@ -56,13 +58,13 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1  | Accuracy |
-|:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:---:|:--------:|
-| No log        | 1.0   | 32   | 0.7111          | 0.0       | 0.0    | 0.0 | 0.8695   |
-| No log        | 2.0   | 64   | 0.7154          | 0.0       | 0.0    | 0.0 | 0.8695   |
-| No log        | 3.0   | 96   | 0.7107          | 0.0       | 0.0    | 0.0 | 0.8695   |
-| No log        | 4.0   | 128  | 0.7107          | 0.0       | 0.0    | 0.0 | 0.8695   |
-| No log        | 5.0   | 160  | 0.7097          | 0.0       | 0.0    | 0.0 | 0.8695   |
 ### Framework versions

 This model is a fine-tuned version of [google-bert/bert-base-uncased](https://huggingface.co/google-bert/bert-base-uncased) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1111
+- Precision: 0.8022
+- Recall: 0.7972
+- F1: 0.7997
+- Accuracy: 0.9747
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0002
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.05
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
+|:-------------:|:------:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
+| No log        | 0.9970 | 252  | 0.0973          | 0.7590    | 0.7702 | 0.7646 | 0.9726   |
+| 0.164         | 1.9980 | 505  | 0.0867          | 0.7999    | 0.7776 | 0.7886 | 0.9751   |
+| 0.164         | 2.9990 | 758  | 0.0903          | 0.8044    | 0.7862 | 0.7952 | 0.9747   |
+| 0.0439        | 4.0    | 1011 | 0.0970          | 0.8032    | 0.7960 | 0.7996 | 0.9746   |
+| 0.0439        | 4.9852 | 1260 | 0.1111          | 0.8022    | 0.7972 | 0.7997 | 0.9747   |
 ### Framework versions

runs/Aug23_13-54-39_ee1898c059d7/events.out.tfevents.1724421280.ee1898c059d7.1664.4 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9b03bbacae291621ec8215a080ba00fe31dfdb961d6dfd289f63668ad7997c43
-size 8053

 version https://git-lfs.github.com/spec/v1
+oid sha256:841eb39355f627a882744dbf9312bdc0a65695d5d80c9df1e01e0e3c772e8b04
+size 8879