922-CA
/

monika-lm-lora-tests

Safetensors

GGUF

Model card Files Files and versions

xet

Community

922CA commited on Aug 24, 2023

Commit

4403e1e

1 Parent(s): c3714d8

Update README.md

Browse files

Files changed (1) hide show

README.md +7 -7

README.md CHANGED Viewed

@@ -13,19 +13,19 @@ Noting the last remark, while the lora works it was really just for getting more
 * Fine-tuned on Monika dialogue (dataset of 12 items, manually edited)
 * Formatted in chat/RP style, replacing "Human" and "Assistant" with "Player" and "Monika"
 * From chat LLaMA-2-7b
-* Lora of Delphi v0.1
 # lm2_08152023a
 * Fine-tuned on Monika dialogue from DDLC, reddit, and twitter (dataset of 520 items)
 * Formatted in chat/RP style, replacing "Human" and "Assistant" with "Player" and "Monika"
 * From chat LLaMA-2-7b (testing our new dataset)
-* Lora of Delphi v0.2
 # lm2_08152023b
 * Fine-tuned on Monika dialogue from DDLC, reddit, and twitter (dataset of ~426 items, further cleaned)
 * Formatted in chat/RP style, replacing "Human" and "Assistant" with "Player" and "Monika"
 * From chat LLaMA-2-7b (was going to try airoboros, and then nous hermes. But would always OOM or crash- will try in near future)
-* Lora of Delphi v0.2a
 # lm2_08162023c
 * Fine-tuned on Monika dialogue from DDLC, reddit, and twitter (dataset of ~426 items, further cleaned)
@@ -53,7 +53,7 @@ Noting the last remark, while the lora works it was really just for getting more
 * 150 steps (cut-off before overfit)
 * Formatted in chat/RP style, replacing "Human" and "Assistant" with "Player" and "Monika"
 * From chat LLaMA-2-7b
-* Lora of Delphi v0.2e
 # llama-2-7b-chat-monika-v0.3 (~08/20/2023)
 * Fine-tuned on Monika dialogue from DDLC, reddit, and twitter (dataset of ~600 items augmented by Nous Hermes 13b to turn into multi-turn chat dialogue + 1st dataset of 12 items)
@@ -67,18 +67,18 @@ Noting the last remark, while the lora works it was really just for getting more
 * 1 epoch
 * Formatted in chat/RP style, replacing "Human" and "Assistant" with "Player" and "Monika"
 * From chat LLaMA-2-7b
-* Lora of Delphi/Monika v0.3a
 # llama-2-7b-chat-monika-v0.3b (~08/20/2023)
 * Fine-tuned on Monika dialogue from DDLC, reddit, and twitter (dataset of ~600 items augmented by Nous Hermes 13b to turn into multi-turn chat dialogue + 1st dataset of 12 items)
 * 2 epochs
 * Formatted in chat/RP style, replacing "Human" and "Assistant" with "Player" and "Monika"
 * From chat LLaMA-2-7b
-* Lora of Delphi/Monika v0.3b
 # llama-2-7b-chat-monika-v0.3c (~08/21/2023)
 * Fine-tuned on Monika dialogue from DDLC, reddit, and twitter (dataset of ~600 items augmented by Nous Hermes 13b to turn into multi-turn chat dialogue + 1st dataset of 12 items)
 * 3 epochs + changed some hyperparams (smaller lora, faster training)
 * Formatted in chat/RP style, replacing "Human" and "Assistant" with "Player" and "Monika"
 * From chat LLaMA-2-7b
-* Lora of Delphi/Monika v0.3c (compute ran out before merge was completed)

 * Fine-tuned on Monika dialogue (dataset of 12 items, manually edited)
 * Formatted in chat/RP style, replacing "Human" and "Assistant" with "Player" and "Monika"
 * From chat LLaMA-2-7b
+* Lora of [Delphi v0.1](https://huggingface.co/922-CA/llama-2-7b-delphi-v0.1)
 # lm2_08152023a
 * Fine-tuned on Monika dialogue from DDLC, reddit, and twitter (dataset of 520 items)
 * Formatted in chat/RP style, replacing "Human" and "Assistant" with "Player" and "Monika"
 * From chat LLaMA-2-7b (testing our new dataset)
+* Lora of [Delphi v0.2](https://huggingface.co/922-CA/llama-2-7b-delphi-v0.2)
 # lm2_08152023b
 * Fine-tuned on Monika dialogue from DDLC, reddit, and twitter (dataset of ~426 items, further cleaned)
 * Formatted in chat/RP style, replacing "Human" and "Assistant" with "Player" and "Monika"
 * From chat LLaMA-2-7b (was going to try airoboros, and then nous hermes. But would always OOM or crash- will try in near future)
+* Lora of [Delphi v0.2a](https://huggingface.co/922-CA/llama-2-7b-delphi-v0.2a)
 # lm2_08162023c
 * Fine-tuned on Monika dialogue from DDLC, reddit, and twitter (dataset of ~426 items, further cleaned)
 * 150 steps (cut-off before overfit)
 * Formatted in chat/RP style, replacing "Human" and "Assistant" with "Player" and "Monika"
 * From chat LLaMA-2-7b
+* Lora of [Delphi v0.2e](https://huggingface.co/922-CA/llama-2-7b-delphi-v0.2e)
 # llama-2-7b-chat-monika-v0.3 (~08/20/2023)
 * Fine-tuned on Monika dialogue from DDLC, reddit, and twitter (dataset of ~600 items augmented by Nous Hermes 13b to turn into multi-turn chat dialogue + 1st dataset of 12 items)
 * 1 epoch
 * Formatted in chat/RP style, replacing "Human" and "Assistant" with "Player" and "Monika"
 * From chat LLaMA-2-7b
+* Lora of [Delphi/Monika v0.3a](https://huggingface.co/922-CA/llama-2-7b-monika-v0.3a)
 # llama-2-7b-chat-monika-v0.3b (~08/20/2023)
 * Fine-tuned on Monika dialogue from DDLC, reddit, and twitter (dataset of ~600 items augmented by Nous Hermes 13b to turn into multi-turn chat dialogue + 1st dataset of 12 items)
 * 2 epochs
 * Formatted in chat/RP style, replacing "Human" and "Assistant" with "Player" and "Monika"
 * From chat LLaMA-2-7b
+* Lora of [Delphi/Monika v0.3b](https://huggingface.co/922-CA/llama-2-7b-monika-v0.3b)
 # llama-2-7b-chat-monika-v0.3c (~08/21/2023)
 * Fine-tuned on Monika dialogue from DDLC, reddit, and twitter (dataset of ~600 items augmented by Nous Hermes 13b to turn into multi-turn chat dialogue + 1st dataset of 12 items)
 * 3 epochs + changed some hyperparams (smaller lora, faster training)
 * Formatted in chat/RP style, replacing "Human" and "Assistant" with "Player" and "Monika"
 * From chat LLaMA-2-7b
+* Lora of [Delphi/Monika v0.3c1](https://huggingface.co/922-CA/llama-2-7b-monika-v0.3c1)