Update README.md
Browse files
README.md
CHANGED
@@ -4,6 +4,10 @@ tags:
|
|
4 |
- DNA
|
5 |
- biology
|
6 |
- genomics
|
|
|
|
|
|
|
|
|
7 |
---
|
8 |
## Project Description
|
9 |
This repository contains the trained model for our paper: **Fine-tuning a Sentence Transformer for DNA & Protein tasks** that is currently under review at BMC Bioinformatics. This model, called **simcse-dna**; is based on the original implementation of **SimCSE [1]**. The original model was adapted for DNA downstream tasks by training it on a small sample size k-mer tokens generated from the human reference genome, and can be used to generate sentence embeddings for DNA tasks.
|
|
|
4 |
- DNA
|
5 |
- biology
|
6 |
- genomics
|
7 |
+
- protein
|
8 |
+
- kmer
|
9 |
+
- cancer
|
10 |
+
- gleason-grade-group
|
11 |
---
|
12 |
## Project Description
|
13 |
This repository contains the trained model for our paper: **Fine-tuning a Sentence Transformer for DNA & Protein tasks** that is currently under review at BMC Bioinformatics. This model, called **simcse-dna**; is based on the original implementation of **SimCSE [1]**. The original model was adapted for DNA downstream tasks by training it on a small sample size k-mer tokens generated from the human reference genome, and can be used to generate sentence embeddings for DNA tasks.
|