Victor commited on
Commit
0f078ba
·
1 Parent(s): 7173f68

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +22 -1
README.md CHANGED
@@ -21,8 +21,29 @@ model-index:
21
  type: f_score
22
  value: 0.9769585253
23
  widget:
24
- - text: "Cinammaldehyde is a fragrant compound found in cinammon. Arachidic acid, also known as icosanoic acid, is a saturated fatty acid with a 20-carbon chain. Triptane is commonly used as an anti-knock additive in aviation fuels. Benzophenone is a widely used building block in organic chemistry, being the parent diarylketone. Geraniol is a monoterpenoid and an alcohol. It is the primary component of citronella oil and is a primary component of rose oil, palmarosa oil."
25
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
26
  | Feature | Description |
27
  | --- | --- |
28
  | **Name** | `en_chemner` |
 
21
  type: f_score
22
  value: 0.9769585253
23
  widget:
24
+ - text: "Cinammaldehyde is a fragrant compound found in cinammon. Icosanoic acid, is a saturated fatty acid with a 20-carbon chain. Triptane is commonly used as an anti-knock additive in aviation fuels. Benzophenone is a widely used building block in organic chemistry, being the parent diarylketone. Geraniol is a monoterpenoid and an alcohol. It is the primary component of citronella oil and is a primary component of rose oil, palmarosa oil."
25
  ---
26
+ # en_chemner: A spaCy Model for Chemical NER
27
+
28
+ ## Model Description
29
+
30
+ The `en_chemner` model is a specialized Named Entity Recognition (NER) tool designed for the field of chemistry. Built using the spaCy framework,
31
+ it identifies and classifies chemical entities within English-language texts.
32
+
33
+ ### Key Features
34
+
35
+ - **High Precision and Recall**: With a precision of 99.07% and a recall of 96.36%, the model offers highly accurate entity recognition, minimizing both false positives and false negatives.
36
+ - **Rich Label Scheme**: The model can identify a variety of chemical entities such as alcohols, aldehydes, alkanes, and more, making it versatile for different chemical analysis tasks.
37
+ - **Optimized for spaCy**: Integrated seamlessly with spaCy (>=3.6.1,<3.7.0), allowing for easy incorporation into existing spaCy pipelines and applications.
38
+ - **Extensive Vector Library**: Comes with over 514,000 unique vectors, each with 300 dimensions, providing a rich foundation for understanding and classifying chemical entities.
39
+
40
+ ### Use Cases
41
+
42
+ The `en_chemner` model is ideal for:
43
+ - **Chemical Literature Analysis**: Automatically extracting chemical entities from research papers, patents, and textbooks.
44
+ - **Data Annotation**: Assisting in the annotation of chemical databases or creating datasets for further machine learning tasks.
45
+ - **Educational Purposes**: Helping students in chemistry-related fields to identify and understand various chemical compounds and their classifications.
46
+ -
47
  | Feature | Description |
48
  | --- | --- |
49
  | **Name** | `en_chemner` |