Add SetFit ABSA model

Browse files

Files changed (6) hide show

README.md +49 -49
config.json +1 -2
config_sentence_transformers.json +2 -2
config_setfit.json +4 -6
model.safetensors +1 -1
model_head.pkl +2 -2

README.md CHANGED Viewed

@@ -6,19 +6,20 @@ tags:
 - text-classification
 - generated_from_setfit_trainer
 widget:
-- text: '- they use fresh mozzarella instead of the:The pizza is delicious - they
-    use fresh mozzarella instead of the cheap, frozen, shredded cheese common to most
-    pizzaria''s.'
-- text: 'refinement:  Food, though somewhat:An oasis of refinement:  Food, though
-    somewhat uneven, often reaches the pinnacles of new American fine cuisine - chef''s
-    passion (and kitchen''s precise execution) is most evident in the fish dishes
-    and soups.'
-- text: We had the lobster sandwich and it was:We had the lobster sandwich and it
-    was FANTASTIC.
-- text: The fish is fresh but:The fish is fresh but the variety of fish is nothing
-    out of ordinary.
-- text: with classic upscale Italian decor.:Nice restaurant overall, with classic
-    upscale Italian decor.
 metrics:
 - accuracy
 pipeline_tag: text-classification
@@ -26,7 +27,7 @@ library_name: setfit
 inference: false
 base_model: sentence-transformers/all-MiniLM-L6-v2
 model-index:
-- name: SetFit Polarity Model with sentence-transformers/all-MiniLM-L6-v2
   results:
   - task:
       type: text-classification
@@ -37,13 +38,13 @@ model-index:
       split: test
     metrics:
     - type: accuracy
-      value: 0.7244094488188977
       name: Accuracy
 ---
-# SetFit Polarity Model with sentence-transformers/all-MiniLM-L6-v2
-This is a [SetFit](https://github.com/huggingface/setfit) model that can be used for Aspect Based Sentiment Analysis (ABSA). This SetFit model uses [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2) as the Sentence Transformer embedding model. A [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance is used for classification. In particular, this model is in charge of classifying aspect polarities.
 The model has been trained using an efficient few-shot learning technique that involves:
@@ -53,8 +54,8 @@ The model has been trained using an efficient few-shot learning technique that i
 This model was trained within the context of a larger system for ABSA, which looks like so:
 1. Use a spaCy model to select possible aspect span candidates.
-2. Use a SetFit model to filter these possible aspect span candidates.
-3. **Use this SetFit model to classify the filtered aspect span candidates.**
 ## Model Details
@@ -63,10 +64,10 @@ This model was trained within the context of a larger system for ABSA, which loo
 - **Sentence Transformer body:** [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2)
 - **Classification head:** a [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance
 - **spaCy Model:** en_core_web_lg
-- **SetFitABSA Aspect Model:** [setfit-absa-aspect](https://huggingface.co/setfit-absa-aspect)
-- **SetFitABSA Polarity Model:** [ronalhung/setfit-absa-restaurants-aspect](https://huggingface.co/ronalhung/setfit-absa-restaurants-aspect)
 - **Maximum Sequence Length:** 256 tokens
-- **Number of Classes:** 4 classes
 <!-- - **Training Dataset:** [Unknown](https://huggingface.co/datasets/unknown) -->
 <!-- - **Language:** Unknown -->
 <!-- - **License:** Unknown -->
@@ -78,19 +79,17 @@ This model was trained within the context of a larger system for ABSA, which loo
 - **Blogpost:** [SetFit: Efficient Few-Shot Learning Without Prompts](https://huggingface.co/blog/setfit)
 ### Model Labels
-| Label    | Examples                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |
-|:---------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
-| negative | <ul><li>'But the staff was so horrible:But the staff was so horrible to us.'</li><li>', forgot our toast, left out:They did not have mayonnaise, forgot our toast, left out ingredients (ie cheese in an omelet), below hot temperatures and the bacon was so over cooked it crumbled on the plate when you touched it.'</li><li>'did not have mayonnaise, forgot our:They did not have mayonnaise, forgot our toast, left out ingredients (ie cheese in an omelet), below hot temperatures and the bacon was so over cooked it crumbled on the plate when you touched it.'</li></ul>                                          |
-| positive | <ul><li>"factor was the food, which was:To be completely fair, the only redeeming factor was the food, which was above average, but couldn't make up for all the other deficiencies of Teodora."</li><li>"The food is uniformly exceptional:The food is uniformly exceptional, with a very capable kitchen which will proudly whip up whatever you feel like eating, whether it's on the menu or not."</li><li>"a very capable kitchen which will proudly:The food is uniformly exceptional, with a very capable kitchen which will proudly whip up whatever you feel like eating, whether it's on the menu or not."</li></ul> |
-| neutral  | <ul><li>"'s on the menu or not.:The food is uniformly exceptional, with a very capable kitchen which will proudly whip up whatever you feel like eating, whether it's on the menu or not."</li><li>'to sample both meats).:Our agreed favorite is the orrechiete with sausage and chicken (usually the waiters are kind enough to split the dish in half so you get to sample both meats).'</li><li>'to split the dish in half so:Our agreed favorite is the orrechiete with sausage and chicken (usually the waiters are kind enough to split the dish in half so you get to sample both meats).'</li></ul>                   |
-| conflict | <ul><li>'The food was delicious but:The food was delicious but do not come here on a empty stomach.'</li><li>"The service varys from day:The service varys from day to day- sometimes they're very nice, and sometimes not."</li></ul>                                                                                                                                                                                                                                                                                                                                                                                         |
 ## Evaluation
 ### Metrics
 | Label   | Accuracy |
 |:--------|:---------|
-| **all** | 0.7244   |
 ## Uses
@@ -109,8 +108,8 @@ from setfit import AbsaModel
 # Download from the 🤗 Hub
 model = AbsaModel.from_pretrained(
-    "setfit-absa-aspect",
     "ronalhung/setfit-absa-restaurants-aspect",
 )
 # Run inference
 preds = model("The food was great, but the venue is just way too busy.")
@@ -145,14 +144,12 @@ preds = model("The food was great, but the venue is just way too busy.")
 ### Training Set Metrics
 | Training set | Min | Median  | Max |
 |:-------------|:----|:--------|:----|
-| Word count   | 6   | 21.3594 | 43  |
-| Label    | Training Sample Count |
-|:---------|:----------------------|
-| conflict | 2                     |
-| negative | 19                    |
-| neutral  | 25                    |
-| positive | 82                    |
 ### Training Hyperparameters
 - batch_size: (128, 128)
@@ -175,23 +172,26 @@ preds = model("The food was great, but the venue is just way too busy.")
 ### Training Results
 | Epoch  | Step | Training Loss | Validation Loss |
 |:------:|:----:|:-------------:|:---------------:|
-| 0.0147 | 1    | 0.2915        | -               |
-| 0.7353 | 50   | 0.1973        | 0.2179          |
-| 1.4706 | 100  | 0.0533        | 0.2569          |
-| 2.2059 | 150  | 0.0209        | 0.2677          |
-| 2.9412 | 200  | 0.0095        | 0.2641          |
-| 3.6765 | 250  | 0.0042        | 0.2705          |
-| 4.4118 | 300  | 0.0031        | 0.2717          |
 ### Framework Versions
-- Python: 3.11.11
 - SetFit: 1.1.2
 - Sentence Transformers: 3.4.1
-- spaCy: 3.7.5
-- Transformers: 4.48.3
-- PyTorch: 2.5.1+cu124
 - Datasets: 3.5.0
-- Tokenizers: 0.21.0
 ## Citation

 - text-classification
 - generated_from_setfit_trainer
 widget:
+- text: food:What is even better, is that the prices are very affordable as well,
+    and the food is really good.
+- text: 'cuisine:An oasis of refinement:  Food, though somewhat uneven, often reaches
+    the pinnacles of new American fine cuisine - chef''s passion (and kitchen''s precise
+    execution) is most evident in the fish dishes and soups.'
+- text: check:Had we been stalling I could understand where they were coming from,
+    but we had been there less than an hour and they hadn't even brought us a check
+    yet!
+- text: back waiters:I understand the area and folks you need not come here for the
+    romantic, alluring ambiance or the five star service featuring a sommlier and
+    a complicated maze of captain and back waiters - you come for the authentic foods,
+    the tastes, the experiance.
+- text: wine selection:The wine selection (by the glass and bottle) is wonderful and
+    I always recommend that friends make a reservation if they're going to be in town.
 metrics:
 - accuracy
 pipeline_tag: text-classification
 inference: false
 base_model: sentence-transformers/all-MiniLM-L6-v2
 model-index:
+- name: SetFit Aspect Model with sentence-transformers/all-MiniLM-L6-v2
   results:
   - task:
       type: text-classification
       split: test
     metrics:
     - type: accuracy
+      value: 0.8539823008849557
       name: Accuracy
 ---
+# SetFit Aspect Model with sentence-transformers/all-MiniLM-L6-v2
+This is a [SetFit](https://github.com/huggingface/setfit) model that can be used for Aspect Based Sentiment Analysis (ABSA). This SetFit model uses [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2) as the Sentence Transformer embedding model. A [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance is used for classification. In particular, this model is in charge of filtering aspect span candidates.
 The model has been trained using an efficient few-shot learning technique that involves:
 This model was trained within the context of a larger system for ABSA, which looks like so:
 1. Use a spaCy model to select possible aspect span candidates.
+2. **Use this SetFit model to filter these possible aspect span candidates.**
+3. Use a SetFit model to classify the filtered aspect span candidates.
 ## Model Details
 - **Sentence Transformer body:** [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2)
 - **Classification head:** a [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance
 - **spaCy Model:** en_core_web_lg
+- **SetFitABSA Aspect Model:** [ronalhung/setfit-absa-restaurants-aspect](https://huggingface.co/ronalhung/setfit-absa-restaurants-aspect)
+- **SetFitABSA Polarity Model:** [ronalhung/setfit-absa-restaurants-polarity](https://huggingface.co/ronalhung/setfit-absa-restaurants-polarity)
 - **Maximum Sequence Length:** 256 tokens
+- **Number of Classes:** 2 classes
 <!-- - **Training Dataset:** [Unknown](https://huggingface.co/datasets/unknown) -->
 <!-- - **Language:** Unknown -->
 <!-- - **License:** Unknown -->
 - **Blogpost:** [SetFit: Efficient Few-Shot Learning Without Prompts](https://huggingface.co/blog/setfit)
 ### Model Labels
+| Label     | Examples                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
+|:----------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| aspect    | <ul><li>'staff:But the staff was so horrible to us.'</li><li>"food:To be completely fair, the only redeeming factor was the food, which was above average, but couldn't make up for all the other deficiencies of Teodora."</li><li>"food:The food is uniformly exceptional, with a very capable kitchen which will proudly whip up whatever you feel like eating, whether it's on the menu or not."</li></ul>                                                                                                                              |
+| no aspect | <ul><li>"factor:To be completely fair, the only redeeming factor was the food, which was above average, but couldn't make up for all the other deficiencies of Teodora."</li><li>"deficiencies:To be completely fair, the only redeeming factor was the food, which was above average, but couldn't make up for all the other deficiencies of Teodora."</li><li>"Teodora:To be completely fair, the only redeeming factor was the food, which was above average, but couldn't make up for all the other deficiencies of Teodora."</li></ul> |
 ## Evaluation
 ### Metrics
 | Label   | Accuracy |
 |:--------|:---------|
+| **all** | 0.8540   |
 ## Uses
 # Download from the 🤗 Hub
 model = AbsaModel.from_pretrained(
     "ronalhung/setfit-absa-restaurants-aspect",
+    "ronalhung/setfit-absa-restaurants-polarity",
 )
 # Run inference
 preds = model("The food was great, but the venue is just way too busy.")
 ### Training Set Metrics
 | Training set | Min | Median  | Max |
 |:-------------|:----|:--------|:----|
+| Word count   | 4   | 17.9296 | 37  |
+| Label     | Training Sample Count |
+|:----------|:----------------------|
+| no aspect | 71                    |
+| aspect    | 128                   |
 ### Training Hyperparameters
 - batch_size: (128, 128)
 ### Training Results
 | Epoch  | Step | Training Loss | Validation Loss |
 |:------:|:----:|:-------------:|:---------------:|
+| 0.0059 | 1    | 0.311         | -               |
+| 0.2959 | 50   | 0.2716        | 0.2559          |
+| 0.5917 | 100  | 0.1808        | 0.2302          |
+| 0.8876 | 150  | 0.0201        | 0.2321          |
+| 1.1834 | 200  | 0.0027        | 0.2249          |
+| 1.4793 | 250  | 0.0017        | 0.2325          |
+| 1.7751 | 300  | 0.0008        | 0.2361          |
+| 2.0710 | 350  | 0.0007        | 0.2496          |
+| 2.3669 | 400  | 0.0006        | 0.2306          |
+| 2.6627 | 450  | 0.0005        | 0.2370          |
 ### Framework Versions
+- Python: 3.11.12
 - SetFit: 1.1.2
 - Sentence Transformers: 3.4.1
+- spaCy: 3.8.5
+- Transformers: 4.51.3
+- PyTorch: 2.6.0+cu124
 - Datasets: 3.5.0
+- Tokenizers: 0.21.1
 ## Citation

config.json CHANGED Viewed

@@ -1,5 +1,4 @@
 {
-  "_name_or_path": "sentence-transformers/all-MiniLM-L6-v2",
   "architectures": [
     "BertModel"
   ],
@@ -19,7 +18,7 @@
   "pad_token_id": 0,
   "position_embedding_type": "absolute",
   "torch_dtype": "float32",
-  "transformers_version": "4.48.3",
   "type_vocab_size": 2,
   "use_cache": true,
   "vocab_size": 30522

 {
   "architectures": [
     "BertModel"
   ],
   "pad_token_id": 0,
   "position_embedding_type": "absolute",
   "torch_dtype": "float32",
+  "transformers_version": "4.51.3",
   "type_vocab_size": 2,
   "use_cache": true,
   "vocab_size": 30522

config_sentence_transformers.json CHANGED Viewed

@@ -1,8 +1,8 @@
 {
   "__version__": {
     "sentence_transformers": "3.4.1",
-    "transformers": "4.48.3",
-    "pytorch": "2.5.1+cu124"
   },
   "prompts": {},
   "default_prompt_name": null,

 {
   "__version__": {
     "sentence_transformers": "3.4.1",
+    "transformers": "4.51.3",
+    "pytorch": "2.6.0+cu124"
   },
   "prompts": {},
   "default_prompt_name": null,

config_setfit.json CHANGED Viewed

@@ -1,11 +1,9 @@
 {
   "labels": [
-    "conflict",
-    "negative",
-    "neutral",
-    "positive"
   ],
   "spacy_model": "en_core_web_lg",
-  "normalize_embeddings": false,
-  "span_context": 3
 }

 {
+  "span_context": 0,
   "labels": [
+    "no aspect",
+    "aspect"
   ],
   "spacy_model": "en_core_web_lg",
+  "normalize_embeddings": false
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:95bec7361c4e0094bc34b1fed6ee352c06feb5de60a9ef6e89ab83e0f532050b
 size 90864192

 version https://git-lfs.github.com/spec/v1
+oid sha256:3b18d16aca34ce420c1788dc1b084dc9dae7d95dc3d0656b272a85c8107817d1
 size 90864192

model_head.pkl CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bdd4039a9c00e09f3e4c90dde222d2015695c03dddf2020e01aceaca7a9b711f
-size 13287

 version https://git-lfs.github.com/spec/v1
+oid sha256:5fe8b5e479477a6226d5bce3f8d401bd7bdfa0c71b6e66857b1d6fe4ce24e75b
+size 3919