adamo1139
/

Yi-34B-200K-AEZAKMI-RAW-2301-LoRA

Model card Files Files and versions Community

adamo1139 commited on Jan 23, 2024

Commit

a4fab27

·

verified ·

1 Parent(s): 8616985

Update README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -3,3 +3,13 @@ license: other
 license_name: yi-license
 license_link: LICENSE
 ---

 license_name: yi-license
 license_link: LICENSE
 ---
+Another AEZAKMI v2 finetune over Yi-34B-200K-rawrr-r3. Sequence length 2200
+I was able to squeeze that in using Unsloth, script I used is in this repo.
+Training took around 18 hours on local RTX 3090 Ti.
+Will be uploading fp16 and exl2 soon. So far it seems like de-contaminating Yi worked nicely.
+This lora goes over Yi-34B-200K-rawrr1-LORA-DPO-experimental-r3 lora.
+So first get Yi-34B-200K llamafied, merge in Yi-34B-200K-rawrr1-LORA-DPO-experimental-r3, then merge in this lora.
+Credits for mlabonne (I was using his Mistral fine-tuning script pieces for dataset preparation), Daniel Han and Michael Han (Unsloth AI team)
+[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" alt="made with Unsloth" width="400" height="64"/>](https://github.com/unslothai/unsloth)