Commit
·
e684670
1
Parent(s):
2d25fd2
Update README.md
Browse files
README.md
CHANGED
@@ -6,7 +6,7 @@ license: mit
|
|
6 |
|
7 |
This model is a full finetune of ESM-2, to illustrate how full finetuning overfits and generalizes quite poorly compared to
|
8 |
LoRA and QLoRA finetuning. This model was finetuned on the 600K dataset. We also note that on the 24GB A10 GPU, the batch size
|
9 |
-
has to be significantly smaller. To finetune a similar model, use
|
10 |
[this script](https://huggingface.co/AmelieSchreiber/esm2_t6_8M_binding_sites_finetune/blob/main/finetune.py).
|
11 |
|
12 |
## Overfitting
|
|
|
6 |
|
7 |
This model is a full finetune of ESM-2, to illustrate how full finetuning overfits and generalizes quite poorly compared to
|
8 |
LoRA and QLoRA finetuning. This model was finetuned on the 600K dataset. We also note that on the 24GB A10 GPU, the batch size
|
9 |
+
has to be significantly smaller than when using LoRA or QLoRA. To finetune a similar model, use
|
10 |
[this script](https://huggingface.co/AmelieSchreiber/esm2_t6_8M_binding_sites_finetune/blob/main/finetune.py).
|
11 |
|
12 |
## Overfitting
|