Fighoture
/

Llama-2-7b-chat-shortgpt-with-angular-25-percent-fusion-lora-5k

Text Generation

text-generation-inference

Model card Files Files and versions Community

Fighoture commited on May 28, 2024

Commit

03bed34

·

verified ·

1 Parent(s): 5788749

Update README.md

Files changed (1) hide show

README.md +8 -2

README.md CHANGED Viewed

@@ -6,14 +6,20 @@ tags: []
 # Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
 ## Model Details
 ### Model Description
-<!-- Provide a longer summary of what this model is. -->
 This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.

 # Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
+This model aims to optimize QA & summerization tasks for the capstone project "Edge LLM - Reducing LLM Memory Footprint to < 2GB" in UW, sponsered by Amazon.
 ## Model Details
 ### Model Description
+Base model is Fighoture/Llama-2-7b-chat-shortgpt-with-angular-25-percent, which has been pruned with shortgpt by 25%(8) layers according to angular distance.
+This model is fine-tuned by a combination of dataset including:
+1. Randomly-selected 2.5k sample of sharegpt dataset.
+2. Randomly-selected 1.25k sample of stanford gpt4 alpaca dataset, whcih is one part from tuluv2.
+3. Randomly-selected 1.25k sample of openassistant English dataset.
 This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.