dardem
/

distilbert-base-uk-cased-toxicity

Text Classification

Model card Files Files and versions Community

dardem commited on Nov 8, 2023

Commit

bf4e04b

·

1 Parent(s): fe7976c

Update README.md

Files changed (1) hide show

README.md +18 -3

README.md CHANGED Viewed

@@ -6,14 +6,29 @@ widget:
 - text: "Ти неймовірна!"
 ---
-Binary toxicity classifier for Ukrainian.
 This is the fine-tuned on the downstream task ["Geotrend/distilbert-base-uk-cased"](https://huggingface.co/Geotrend/distilbert-base-uk-cased) instance.
-The evaluation metrics are:
 **Precision**: 0.9242
 **Recall**: 0.9225
 **F1**: 0.9224
-The training and evaluation data will be clarified later.

 - text: "Ти неймовірна!"
 ---
+## Binary toxicity classifier for Ukrainian.
 This is the fine-tuned on the downstream task ["Geotrend/distilbert-base-uk-cased"](https://huggingface.co/Geotrend/distilbert-base-uk-cased) instance.
+The evaluation metrics for binary toxicity classification are:
 **Precision**: 0.9242
 **Recall**: 0.9225
 **F1**: 0.9224
+The training and evaluation data will be clarified later.
+## How to use
+```python
+from transformers import AutoTokenizer, AutoModelForSequenceClassification
+# load tokenizer and model weights
+tokenizer = AutoTokenizer.from_pretrained('dardem/distilbert-base-uk-cased-toxicity')
+model = AutoModelForSequenceClassification.from_pretrained('dardem/distilbert-base-uk-cased-toxicity')
+# prepare the input
+batch = tokenizer.encode('Ти неймовірна!', return_tensors='pt')
+# inference
+model(batch)
+```