Add SetFit ABSA model

Browse files

Files changed (6) hide show

README.md +31 -37
config.json +2 -1
config_sentence_transformers.json +2 -2
config_setfit.json +2 -2
model.safetensors +1 -1
model_head.pkl +1 -1

README.md CHANGED Viewed

@@ -6,16 +6,17 @@ tags:
 - text-classification
 - generated_from_setfit_trainer
 widget:
-- text: food:The food was bland oily.
 - text: 'soups:An oasis of refinement:  Food, though somewhat uneven, often reaches
     the pinnacles of new American fine cuisine - chef''s passion (and kitchen''s precise
     execution) is most evident in the fish dishes and soups.'
 - text: lobster sandwich:We had the lobster sandwich and it was FANTASTIC.
-- text: sommlier:I understand the area and folks you need not come here for the romantic,
-    alluring ambiance or the five star service featuring a sommlier and a complicated
-    maze of captain and back waiters - you come for the authentic foods, the tastes,
-    the experiance.
-- text: food:Not impressed with the food.
 metrics:
 - accuracy
 pipeline_tag: text-classification
@@ -34,7 +35,7 @@ model-index:
       split: test
     metrics:
     - type: accuracy
-      value: 0.8377192982456141
       name: Accuracy
 ---
@@ -59,7 +60,7 @@ This model was trained within the context of a larger system for ABSA, which loo
 - **Model Type:** SetFit
 - **Sentence Transformer body:** [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2)
 - **Classification head:** a [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance
-- **spaCy Model:** en_core_web_sm
 - **SetFitABSA Aspect Model:** [ronalhung/setfit-absa-restaurants-polarity](https://huggingface.co/ronalhung/setfit-absa-restaurants-polarity)
 - **SetFitABSA Polarity Model:** [setfit-absa-polarity](https://huggingface.co/setfit-absa-polarity)
 - **Maximum Sequence Length:** 256 tokens
@@ -85,7 +86,7 @@ This model was trained within the context of a larger system for ABSA, which loo
 ### Metrics
 | Label   | Accuracy |
 |:--------|:---------|
-| **all** | 0.8377   |
 ## Uses
@@ -138,13 +139,13 @@ preds = model("The food was great, but the venue is just way too busy.")
 ## Training Details
 ### Training Set Metrics
-| Training set | Min | Median | Max |
-|:-------------|:----|:-------|:----|
-| Word count   | 4   | 18.0   | 37  |
 | Label     | Training Sample Count |
 |:----------|:----------------------|
-| no aspect | 73                    |
 | aspect    | 128                   |
 ### Training Hyperparameters
@@ -158,7 +159,7 @@ preds = model("The food was great, but the venue is just way too busy.")
 - distance_metric: cosine_distance
 - margin: 0.25
 - end_to_end: False
-- use_amp: True
 - warmup_proportion: 0.1
 - l2_weight: 0.01
 - seed: 42
@@ -168,34 +169,27 @@ preds = model("The food was great, but the venue is just way too busy.")
 ### Training Results
 | Epoch  | Step | Training Loss | Validation Loss |
 |:------:|:----:|:-------------:|:---------------:|
-| 0.0058 | 1    | 0.2702        | -               |
-| 0.2907 | 50   | 0.2764        | 0.2555          |
-| 0.5814 | 100  | 0.1827        | 0.2240          |
-| 0.8721 | 150  | 0.0239        | 0.2296          |
-| 1.1628 | 200  | 0.0027        | 0.2436          |
-| 1.4535 | 250  | 0.0015        | 0.2421          |
-| 1.7442 | 300  | 0.001         | 0.2411          |
-| 2.0349 | 350  | 0.0007        | 0.2431          |
-| 2.3256 | 400  | 0.0005        | 0.2391          |
-| 2.6163 | 450  | 0.0004        | 0.2470          |
-| 2.9070 | 500  | 0.0004        | 0.2381          |
-| 3.1977 | 550  | 0.0003        | 0.2465          |
-| 3.4884 | 600  | 0.0003        | 0.2452          |
-| 3.7791 | 650  | 0.0003        | 0.2478          |
-| 4.0698 | 700  | 0.0003        | 0.2416          |
-| 4.3605 | 750  | 0.0003        | 0.2453          |
-| 4.6512 | 800  | 0.0002        | 0.2433          |
-| 4.9419 | 850  | 0.0003        | 0.2447          |
 ### Framework Versions
-- Python: 3.11.12
 - SetFit: 1.1.2
 - Sentence Transformers: 3.4.1
-- spaCy: 3.8.5
-- Transformers: 4.51.1
-- PyTorch: 2.6.0+cu124
 - Datasets: 3.5.0
-- Tokenizers: 0.21.1
 ## Citation

 - text-classification
 - generated_from_setfit_trainer
 widget:
+- text: food:What is even better, is that the prices are very affordable as well,
+    and the food is really good.
 - text: 'soups:An oasis of refinement:  Food, though somewhat uneven, often reaches
     the pinnacles of new American fine cuisine - chef''s passion (and kitchen''s precise
     execution) is most evident in the fish dishes and soups.'
 - text: lobster sandwich:We had the lobster sandwich and it was FANTASTIC.
+- text: back waiters:I understand the area and folks you need not come here for the
+    romantic, alluring ambiance or the five star service featuring a sommlier and
+    a complicated maze of captain and back waiters - you come for the authentic foods,
+    the tastes, the experiance.
+- text: dining experience:The entire dining experience was wonderful!
 metrics:
 - accuracy
 pipeline_tag: text-classification
       split: test
     metrics:
     - type: accuracy
+      value: 0.8407079646017699
       name: Accuracy
 ---
 - **Model Type:** SetFit
 - **Sentence Transformer body:** [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2)
 - **Classification head:** a [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance
+- **spaCy Model:** en_core_web_lg
 - **SetFitABSA Aspect Model:** [ronalhung/setfit-absa-restaurants-polarity](https://huggingface.co/ronalhung/setfit-absa-restaurants-polarity)
 - **SetFitABSA Polarity Model:** [setfit-absa-polarity](https://huggingface.co/setfit-absa-polarity)
 - **Maximum Sequence Length:** 256 tokens
 ### Metrics
 | Label   | Accuracy |
 |:--------|:---------|
+| **all** | 0.8407   |
 ## Uses
 ## Training Details
 ### Training Set Metrics
+| Training set | Min | Median  | Max |
+|:-------------|:----|:--------|:----|
+| Word count   | 4   | 17.9296 | 37  |
 | Label     | Training Sample Count |
 |:----------|:----------------------|
+| no aspect | 71                    |
 | aspect    | 128                   |
 ### Training Hyperparameters
 - distance_metric: cosine_distance
 - margin: 0.25
 - end_to_end: False
+- use_amp: False
 - warmup_proportion: 0.1
 - l2_weight: 0.01
 - seed: 42
 ### Training Results
 | Epoch  | Step | Training Loss | Validation Loss |
 |:------:|:----:|:-------------:|:---------------:|
+| 0.0015 | 1    | 0.3852        | -               |
+| 0.0740 | 50   | 0.2999        | 0.2943          |
+| 0.0059 | 1    | 0.2573        | -               |
+| 0.2959 | 50   | 0.2516        | 0.2540          |
+| 0.5917 | 100  | 0.1633        | 0.2385          |
+| 0.8876 | 150  | 0.015         | 0.2334          |
+| 1.1834 | 200  | 0.0025        | 0.2396          |
+| 1.4793 | 250  | 0.0013        | 0.2452          |
+| 1.7751 | 300  | 0.0009        | 0.2491          |
+| 2.0710 | 350  | 0.0007        | 0.2449          |
+| 2.3669 | 400  | 0.0005        | 0.2372          |
 ### Framework Versions
+- Python: 3.11.11
 - SetFit: 1.1.2
 - Sentence Transformers: 3.4.1
+- spaCy: 3.7.5
+- Transformers: 4.48.3
+- PyTorch: 2.5.1+cu124
 - Datasets: 3.5.0
+- Tokenizers: 0.21.0
 ## Citation

config.json CHANGED Viewed

@@ -1,4 +1,5 @@
 {
   "architectures": [
     "BertModel"
   ],
@@ -18,7 +19,7 @@
   "pad_token_id": 0,
   "position_embedding_type": "absolute",
   "torch_dtype": "float32",
-  "transformers_version": "4.51.1",
   "type_vocab_size": 2,
   "use_cache": true,
   "vocab_size": 30522

 {
+  "_name_or_path": "sentence-transformers/all-MiniLM-L6-v2",
   "architectures": [
     "BertModel"
   ],
   "pad_token_id": 0,
   "position_embedding_type": "absolute",
   "torch_dtype": "float32",
+  "transformers_version": "4.48.3",
   "type_vocab_size": 2,
   "use_cache": true,
   "vocab_size": 30522

config_sentence_transformers.json CHANGED Viewed

@@ -1,8 +1,8 @@
 {
   "__version__": {
     "sentence_transformers": "3.4.1",
-    "transformers": "4.51.1",
-    "pytorch": "2.6.0+cu124"
   },
   "prompts": {},
   "default_prompt_name": null,

 {
   "__version__": {
     "sentence_transformers": "3.4.1",
+    "transformers": "4.48.3",
+    "pytorch": "2.5.1+cu124"
   },
   "prompts": {},
   "default_prompt_name": null,

config_setfit.json CHANGED Viewed

@@ -1,9 +1,9 @@
 {
-  "spacy_model": "en_core_web_sm",
-  "normalize_embeddings": false,
   "labels": [
     "no aspect",
     "aspect"
   ],
   "span_context": 0
 }

 {
   "labels": [
     "no aspect",
     "aspect"
   ],
+  "spacy_model": "en_core_web_lg",
+  "normalize_embeddings": false,
   "span_context": 0
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:17490e38d9e2926c3362476b470910c1e2187761472f5c384bf061e009308454
 size 90864192

 version https://git-lfs.github.com/spec/v1
+oid sha256:d7f22811b51b41fc1dc5941afe286dcc3db2bfdb45833898b8a67440c38d48df
 size 90864192

model_head.pkl CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9ebbda54c87e2fc24dad2b6036b5b0d12101285f1d5a2b3a2006b17eca921c5c
 size 3919

 version https://git-lfs.github.com/spec/v1
+oid sha256:bbed4aa053d3a68620daf4411fe1596e6deee17a02f31cb5da5ef55ec0a34a8a
 size 3919