ChakuChidiya
/

distilbert-base-uncased-G1

ChakuChidiya commited on Apr 24, 2024

Commit

e9ed67d

verified ·

1 Parent(s): 9ef64b5

Upload TFDistilBertForTokenClassification

Files changed (3) hide show

README.md CHANGED Viewed

@@ -15,9 +15,9 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 0.0896
-- Validation Loss: 0.3813
-- Epoch: 4
 ## Model description
@@ -36,18 +36,16 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- optimizer: {'name': 'AdamWeightDecay', 'learning_rate': {'module': 'keras.optimizers.schedules', 'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 4e-05, 'decay_steps': 3675, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}, 'registered_name': None}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.07}
 - training_precision: float32
 ### Training results
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
-| 0.5186     | 0.3494          | 0     |
-| 0.2823     | 0.3332          | 1     |
-| 0.1904     | 0.3378          | 2     |
-| 0.1261     | 0.3577          | 3     |
-| 0.0896     | 0.3813          | 4     |
 ### Framework versions

 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 0.1913
+- Validation Loss: 0.3219
+- Epoch: 2
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- optimizer: {'name': 'AdamWeightDecay', 'learning_rate': {'module': 'keras.optimizers.schedules', 'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 4e-05, 'decay_steps': 2205, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}, 'registered_name': None}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.07}
 - training_precision: float32
 ### Training results
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
+| 0.5249     | 0.3549          | 0     |
+| 0.2824     | 0.3316          | 1     |
+| 0.1913     | 0.3219          | 2     |
 ### Framework versions

config.json CHANGED Viewed

@@ -10,25 +10,25 @@
   "hidden_dim": 3072,
   "id2label": {
     "0": "O",
-    "1": "B-treatment",
-    "2": "I-treatment",
-    "3": "B-chronic_disease",
-    "4": "I-chronic_disease",
-    "5": "B-allergy_name",
-    "6": "I-allergy_name",
-    "7": "B-cancer",
-    "8": "I-cancer"
   },
   "initializer_range": 0.02,
   "label2id": {
-    "B-allergy_name": 5,
-    "B-cancer": 7,
-    "B-chronic_disease": 3,
-    "B-treatment": 1,
-    "I-allergy_name": 6,
-    "I-cancer": 8,
-    "I-chronic_disease": 4,
-    "I-treatment": 2,
     "O": 0
   },
   "max_position_embeddings": 512,

   "hidden_dim": 3072,
   "id2label": {
     "0": "O",
+    "1": "B-allergy_name",
+    "2": "I-allergy_name",
+    "3": "B-cancer",
+    "4": "I-cancer",
+    "5": "B-chronic_disease",
+    "6": "I-chronic_disease",
+    "7": "B-treatment",
+    "8": "I-treatment"
   },
   "initializer_range": 0.02,
   "label2id": {
+    "B-allergy_name": 1,
+    "B-cancer": 3,
+    "B-chronic_disease": 5,
+    "B-treatment": 7,
+    "I-allergy_name": 2,
+    "I-cancer": 4,
+    "I-chronic_disease": 6,
+    "I-treatment": 8,
     "O": 0
   },
   "max_position_embeddings": 512,

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a38650202faa13208e9a7a64c73203fb933f4c323e80be2f396c48701ce01d10
 size 265606416

 version https://git-lfs.github.com/spec/v1
+oid sha256:93346d02c5635935e1fb66b6b7c7256eb0cdd95fb1a0ab30bf404aa7cdc105bf
 size 265606416