gokulsrinivasagan
/

bert_tiny_lda_100_v1_stsb

@@ -1,28 +1,13 @@
 ---
 library_name: transformers
-language:
-- en
 base_model: gokulsrinivasagan/bert_tiny_lda_100_v1
 tags:
 - generated_from_trainer
-datasets:
-- glue
 metrics:
 - spearmanr
 model-index:
 - name: bert_tiny_lda_100_v1_stsb
-  results:
-  - task:
-      name: Text Classification
-      type: text-classification
-    dataset:
-      name: GLUE STSB
-      type: glue
-      args: stsb
-    metrics:
-    - name: Spearmanr
-      type: spearmanr
-      value: .nan
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -30,12 +15,12 @@ should probably proofread and complete it, then remove this comment. -->
 # bert_tiny_lda_100_v1_stsb
-This model is a fine-tuned version of [gokulsrinivasagan/bert_tiny_lda_100_v1](https://huggingface.co/gokulsrinivasagan/bert_tiny_lda_100_v1) on the GLUE STSB dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.3547
-- Pearson: nan
-- Spearmanr: nan
-- Combined Score: nan
 ## Model description
@@ -54,7 +39,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.001
 - train_batch_size: 256
 - eval_batch_size: 256
 - seed: 10
@@ -66,20 +51,17 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Pearson | Spearmanr | Combined Score |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:---------:|:--------------:|
-| 4.3798        | 1.0   | 23   | 3.4026          | nan     | nan       | nan            |
-| 2.2683        | 2.0   | 46   | 2.7380          | nan     | nan       | nan            |
-| 2.2687        | 3.0   | 69   | 2.7826          | nan     | nan       | nan            |
-| 2.1923        | 4.0   | 92   | 2.6349          | nan     | nan       | nan            |
-| 2.2028        | 5.0   | 115  | 2.6328          | -0.0113 | -0.0109   | -0.0111        |
-| 2.1848        | 6.0   | 138  | 2.3805          | -0.0246 | -0.0228   | -0.0237        |
-| 2.194         | 7.0   | 161  | 2.8532          | nan     | nan       | nan            |
-| 2.1835        | 8.0   | 184  | 2.5613          | nan     | nan       | nan            |
-| 2.1788        | 9.0   | 207  | 2.3547          | nan     | nan       | nan            |
-| 2.1688        | 10.0  | 230  | 2.4241          | nan     | nan       | nan            |
-| 2.1963        | 11.0  | 253  | 2.6747          | nan     | nan       | nan            |
-| 2.186         | 12.0  | 276  | 2.4512          | nan     | nan       | nan            |
-| 2.186         | 13.0  | 299  | 2.4655          | nan     | nan       | nan            |
-| 2.2089        | 14.0  | 322  | 2.7380          | nan     | nan       | nan            |
 ### Framework versions

 ---
 library_name: transformers
 base_model: gokulsrinivasagan/bert_tiny_lda_100_v1
 tags:
 - generated_from_trainer
 metrics:
 - spearmanr
 model-index:
 - name: bert_tiny_lda_100_v1_stsb
+  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # bert_tiny_lda_100_v1_stsb
+This model is a fine-tuned version of [gokulsrinivasagan/bert_tiny_lda_100_v1](https://huggingface.co/gokulsrinivasagan/bert_tiny_lda_100_v1) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.7520
+- Pearson: 0.3013
+- Spearmanr: 0.2887
+- Combined Score: 0.2950
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 5e-05
 - train_batch_size: 256
 - eval_batch_size: 256
 - seed: 10
 | Training Loss | Epoch | Step | Validation Loss | Pearson | Spearmanr | Combined Score |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:---------:|:--------------:|
+| 2.8077        | 1.0   | 23   | 2.3850          | 0.0667  | 0.0654    | 0.0661         |
+| 2.042         | 2.0   | 46   | 2.6089          | 0.0834  | 0.0848    | 0.0841         |
+| 1.9254        | 3.0   | 69   | 2.3926          | 0.1539  | 0.1394    | 0.1466         |
+| 1.8381        | 4.0   | 92   | 2.5560          | 0.1744  | 0.1720    | 0.1732         |
+| 1.6974        | 5.0   | 115  | 3.0257          | 0.1812  | 0.1757    | 0.1784         |
+| 1.5776        | 6.0   | 138  | 2.3632          | 0.2350  | 0.2288    | 0.2319         |
+| 1.2951        | 7.0   | 161  | 2.4535          | 0.2594  | 0.2573    | 0.2584         |
+| 1.0896        | 8.0   | 184  | 2.5246          | 0.2652  | 0.2622    | 0.2637         |
+| 0.9372        | 9.0   | 207  | 2.9827          | 0.2716  | 0.2570    | 0.2643         |
+| 0.7915        | 10.0  | 230  | 2.6918          | 0.3056  | 0.2926    | 0.2991         |
+| 0.673         | 11.0  | 253  | 2.7520          | 0.3013  | 0.2887    | 0.2950         |
 ### Framework versions

logs/events.out.tfevents.1733327310.ki-g0008.1208741.30 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0c8f02e0d60c72445747a8096b1979563aa40fab56781f2d144d1095931f5e5f
-size 10839

 version https://git-lfs.github.com/spec/v1
+oid sha256:7e42e9c55add6eaf2117140ea6892fba25d59488bfa15c62451a78f5718b2c8e
+size 12481

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:10db9716e505ee0cccad36199f98143cd47f68d94331e46a8626a58fa4fb8214
 size 131854692

 version https://git-lfs.github.com/spec/v1
+oid sha256:ae139780e7f3735ef732e566cf5d5b58cee1ee8582b88690196ab17710c5cee9
 size 131854692