Victor
commited on
Commit
·
0f078ba
1
Parent(s):
7173f68
Update README.md
Browse files
README.md
CHANGED
@@ -21,8 +21,29 @@ model-index:
|
|
21 |
type: f_score
|
22 |
value: 0.9769585253
|
23 |
widget:
|
24 |
-
- text: "Cinammaldehyde is a fragrant compound found in cinammon.
|
25 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
26 |
| Feature | Description |
|
27 |
| --- | --- |
|
28 |
| **Name** | `en_chemner` |
|
|
|
21 |
type: f_score
|
22 |
value: 0.9769585253
|
23 |
widget:
|
24 |
+
- text: "Cinammaldehyde is a fragrant compound found in cinammon. Icosanoic acid, is a saturated fatty acid with a 20-carbon chain. Triptane is commonly used as an anti-knock additive in aviation fuels. Benzophenone is a widely used building block in organic chemistry, being the parent diarylketone. Geraniol is a monoterpenoid and an alcohol. It is the primary component of citronella oil and is a primary component of rose oil, palmarosa oil."
|
25 |
---
|
26 |
+
# en_chemner: A spaCy Model for Chemical NER
|
27 |
+
|
28 |
+
## Model Description
|
29 |
+
|
30 |
+
The `en_chemner` model is a specialized Named Entity Recognition (NER) tool designed for the field of chemistry. Built using the spaCy framework,
|
31 |
+
it identifies and classifies chemical entities within English-language texts.
|
32 |
+
|
33 |
+
### Key Features
|
34 |
+
|
35 |
+
- **High Precision and Recall**: With a precision of 99.07% and a recall of 96.36%, the model offers highly accurate entity recognition, minimizing both false positives and false negatives.
|
36 |
+
- **Rich Label Scheme**: The model can identify a variety of chemical entities such as alcohols, aldehydes, alkanes, and more, making it versatile for different chemical analysis tasks.
|
37 |
+
- **Optimized for spaCy**: Integrated seamlessly with spaCy (>=3.6.1,<3.7.0), allowing for easy incorporation into existing spaCy pipelines and applications.
|
38 |
+
- **Extensive Vector Library**: Comes with over 514,000 unique vectors, each with 300 dimensions, providing a rich foundation for understanding and classifying chemical entities.
|
39 |
+
|
40 |
+
### Use Cases
|
41 |
+
|
42 |
+
The `en_chemner` model is ideal for:
|
43 |
+
- **Chemical Literature Analysis**: Automatically extracting chemical entities from research papers, patents, and textbooks.
|
44 |
+
- **Data Annotation**: Assisting in the annotation of chemical databases or creating datasets for further machine learning tasks.
|
45 |
+
- **Educational Purposes**: Helping students in chemistry-related fields to identify and understand various chemical compounds and their classifications.
|
46 |
+
-
|
47 |
| Feature | Description |
|
48 |
| --- | --- |
|
49 |
| **Name** | `en_chemner` |
|