scbtm
/

ModernBERT_wine_quality_reviews_ft

@@ -19,9 +19,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6800
-- Accuracy: 0.6953
-- F1: 0.6945
 ## Model description
@@ -40,9 +40,9 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2e-05
-- train_batch_size: 32
-- eval_batch_size: 32
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.98) and epsilon=1e-06 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
@@ -53,27 +53,22 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss | Accuracy | F1     |
 |:-------------:|:------:|:----:|:---------------:|:--------:|:------:|
-| 1.2688        | 0.0590 | 250  | 1.1315          | 0.4781   | 0.4463 |
-| 1.0574        | 0.1181 | 500  | 0.9664          | 0.5575   | 0.5412 |
-| 0.9229        | 0.1771 | 750  | 0.8647          | 0.6070   | 0.6007 |
-| 0.8654        | 0.2361 | 1000 | 0.8665          | 0.6089   | 0.5922 |
-| 0.8229        | 0.2952 | 1250 | 0.7857          | 0.6448   | 0.6448 |
-| 0.8054        | 0.3542 | 1500 | 0.8515          | 0.6218   | 0.5993 |
-| 0.786         | 0.4132 | 1750 | 0.7533          | 0.6601   | 0.6552 |
-| 0.781         | 0.4723 | 2000 | 0.8133          | 0.6305   | 0.6278 |
-| 0.7563        | 0.5313 | 2250 | 0.7770          | 0.6480   | 0.6473 |
-| 0.7638        | 0.5903 | 2500 | 0.7248          | 0.6767   | 0.6769 |
-| 0.7384        | 0.6494 | 2750 | 0.7520          | 0.6597   | 0.6574 |
-| 0.7405        | 0.7084 | 3000 | 0.7615          | 0.6545   | 0.6515 |
-| 0.7222        | 0.7674 | 3250 | 0.7191          | 0.6790   | 0.6716 |
-| 0.7184        | 0.8264 | 3500 | 0.7037          | 0.6862   | 0.6837 |
-| 0.6984        | 0.8855 | 3750 | 0.7264          | 0.6716   | 0.6678 |
-| 0.6995        | 0.9445 | 4000 | 0.7455          | 0.6663   | 0.6646 |
-| 0.713         | 1.0035 | 4250 | 0.7294          | 0.6752   | 0.6701 |
-| 0.6508        | 1.0626 | 4500 | 0.6938          | 0.6872   | 0.6871 |
-| 0.642         | 1.1216 | 4750 | 0.7266          | 0.6716   | 0.6691 |
-| 0.635         | 1.1806 | 5000 | 0.6868          | 0.6913   | 0.6900 |
-| 0.6278        | 1.2397 | 5250 | 0.6800          | 0.6953   | 0.6945 |
 ### Framework versions

 This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.8255
+- Accuracy: 0.6865
+- F1: 0.6873
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 8e-05
+- train_batch_size: 64
+- eval_batch_size: 64
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.98) and epsilon=1e-06 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 | Training Loss | Epoch  | Step | Validation Loss | Accuracy | F1     |
 |:-------------:|:------:|:----:|:---------------:|:--------:|:------:|
+| 1.0765        | 0.1653 | 350  | 0.8973          | 0.5849   | 0.5797 |
+| 0.848         | 0.3305 | 700  | 0.7721          | 0.6516   | 0.6483 |
+| 0.7796        | 0.4958 | 1050 | 0.7682          | 0.6466   | 0.6470 |
+| 0.7671        | 0.6610 | 1400 | 0.7448          | 0.6611   | 0.6566 |
+| 0.7434        | 0.8263 | 1750 | 0.7378          | 0.6643   | 0.6634 |
+| 0.7232        | 0.9915 | 2100 | 0.7086          | 0.6789   | 0.6736 |
+| 0.653         | 1.1568 | 2450 | 0.7150          | 0.6768   | 0.6764 |
+| 0.6312        | 1.3220 | 2800 | 0.7119          | 0.6785   | 0.6761 |
+| 0.6298        | 1.4873 | 3150 | 0.6982          | 0.6879   | 0.6843 |
+| 0.6307        | 1.6525 | 3500 | 0.7072          | 0.6863   | 0.6864 |
+| 0.6338        | 1.8178 | 3850 | 0.6950          | 0.6862   | 0.6813 |
+| 0.6252        | 1.9830 | 4200 | 0.6996          | 0.6850   | 0.6853 |
+| 0.4418        | 2.1483 | 4550 | 0.8353          | 0.6911   | 0.6899 |
+| 0.4016        | 2.3135 | 4900 | 0.8428          | 0.6825   | 0.6815 |
+| 0.404         | 2.4788 | 5250 | 0.8241          | 0.6824   | 0.6822 |
+| 0.404         | 2.6440 | 5600 | 0.8255          | 0.6865   | 0.6873 |
 ### Framework versions

runs/Jan21_15-19-05_f3acabd62f42/events.out.tfevents.1737476068.f3acabd62f42.35586.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6292a2989249493e834794982034de07429ca1dd0e62b9f786284e407db850cb
+size 457