AmelieSchreiber
/

esm2_t6_8M_binding_sites_finetune_600K

Token Classification

Model card Files Files and versions Community

AmelieSchreiber commited on Oct 8, 2023

Commit

e684670

·

1 Parent(s): 2d25fd2

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -6,7 +6,7 @@ license: mit
 This model is a full finetune of ESM-2, to illustrate how full finetuning overfits and generalizes quite poorly compared to
 LoRA and QLoRA finetuning. This model was finetuned on the 600K dataset. We also note that on the 24GB A10 GPU, the batch size
-has to be significantly smaller. To finetune a similar model, use
 [this script](https://huggingface.co/AmelieSchreiber/esm2_t6_8M_binding_sites_finetune/blob/main/finetune.py).
 ## Overfitting

 This model is a full finetune of ESM-2, to illustrate how full finetuning overfits and generalizes quite poorly compared to
 LoRA and QLoRA finetuning. This model was finetuned on the 600K dataset. We also note that on the 24GB A10 GPU, the batch size
+has to be significantly smaller than when using LoRA or QLoRA. To finetune a similar model, use
 [this script](https://huggingface.co/AmelieSchreiber/esm2_t6_8M_binding_sites_finetune/blob/main/finetune.py).
 ## Overfitting