agi-css
/

distilroberta-base-mic

Text Classification

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

agi-css commited on May 3, 2022

Commit

719f9c5

·

1 Parent(s): 87ef823

update model card README.md

Files changed (1) hide show

README.md +12 -15

README.md CHANGED Viewed

@@ -15,11 +15,11 @@ should probably proofread and complete it, then remove this comment. -->
 # distilroberta-base-mic
-This model is a fine-tuned version of [distilroberta-base](https://huggingface.co/distilroberta-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6948
-- Accuracy: 0.7124
-- F1: 0.7122
 ## Model description
@@ -38,25 +38,22 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 5.8525604794432464e-05
-- train_batch_size: 400
-- eval_batch_size: 400
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 7
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
-| No log        | 1.0   | 22   | 0.5543          | 0.7124   | 0.7056 |
-| No log        | 2.0   | 44   | 0.5304          | 0.7209   | 0.7191 |
-| No log        | 3.0   | 66   | 0.5412          | 0.7331   | 0.7314 |
-| No log        | 4.0   | 88   | 0.5614          | 0.7190   | 0.7175 |
-| No log        | 5.0   | 110  | 0.6271          | 0.7133   | 0.7120 |
-| No log        | 6.0   | 132  | 0.6746          | 0.7030   | 0.7024 |
-| No log        | 7.0   | 154  | 0.6948          | 0.7124   | 0.7122 |
 ### Framework versions

 # distilroberta-base-mic
+This model is a fine-tuned version of [distilroberta-base](https://huggingface.co/distilroberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3435
+- Accuracy: 0.9104
+- F1: 0.9103
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 8.748413056668156e-05
+- train_batch_size: 200
+- eval_batch_size: 200
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 4
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
+| No log        | 1.0   | 120  | 0.2830          | 0.8804   | 0.8797 |
+| No log        | 2.0   | 240  | 0.2398          | 0.9046   | 0.9046 |
+| No log        | 3.0   | 360  | 0.3474          | 0.8959   | 0.8954 |
+| No log        | 4.0   | 480  | 0.3435          | 0.9104   | 0.9103 |
 ### Framework versions