Update README.md
Browse files
README.md
CHANGED
@@ -25,7 +25,7 @@ widget:
|
|
25 |
|
26 |
## Summary
|
27 |
|
28 |
-
The HIV-BERT-Coreceptor model was trained as a refinement of the HIV-BERT model (insert link) and serves to better predict HIV V3 coreceptor tropism. HIV-BERT is a model refined from the ProtBert-BFD model
|
29 |
|
30 |
## Model Description
|
31 |
|
@@ -35,7 +35,7 @@ The HIV-BERT-Coreceptor model is intended to predict the Co-receptor tropism of
|
|
35 |
|
36 |
This tool can be used as a predictor of HIV tropism from the Env-V3 loop. It can recognize both R5, X4, and dual tropic viruses natively. It should not be considered a clinical diagnostic tool.
|
37 |
|
38 |
-
This tool was trained using the Los Alamos HIV sequence dataset
|
39 |
|
40 |
## How to use
|
41 |
|
|
|
25 |
|
26 |
## Summary
|
27 |
|
28 |
+
The HIV-BERT-Coreceptor model was trained as a refinement of the HIV-BERT model (insert link) and serves to better predict HIV V3 coreceptor tropism. HIV-BERT is a model refined from the [ProtBert-BFD model](https://huggingface.co/Rostlab/prot_bert_bfd) to better fulfill HIV-centric tasks. This model was then trained using HIV V3 sequences from the [Los Alamos HIV Sequence Database](https://www.hiv.lanl.gov/content/sequence/HIV/mainpage.html), allowing even more precise prediction of V3 coreceptor tropism than the HIV-BERT model can provide.
|
29 |
|
30 |
## Model Description
|
31 |
|
|
|
35 |
|
36 |
This tool can be used as a predictor of HIV tropism from the Env-V3 loop. It can recognize both R5, X4, and dual tropic viruses natively. It should not be considered a clinical diagnostic tool.
|
37 |
|
38 |
+
This tool was trained using the [Los Alamos HIV sequence dataset](https://www.hiv.lanl.gov/content/sequence/HIV/mainpage.html). Due to the sampling nature of this database, it is predominantly composed of subtype B sequences from North America and Europe with only minor contributions of Subtype C, A, and D. Currently, there was no effort made to balance the performance across these classes. As such, one should consider refinement with additional sequences to perform well on non-B sequences.
|
39 |
|
40 |
## How to use
|
41 |
|