louisbrulenaudet
/

Romulus-cpt-Llama-3.1-8B-v0.1-Instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

louisbrulenaudet commited on Sep 10, 2024

Commit

c280d50

·

verified ·

1 Parent(s): 7a82c49

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -32,7 +32,7 @@ The following table outlines the key hyperparameters used for training Romulus.
 |----------------------------------|-----------------------------------------------------------------|-----------------------------|
 | `max_seq_length`                 | Maximum sequence length for the model                           | 4096                        |
 | `load_in_4bit`                   | Whether to load the model in 4-bit precision                    | False                       |
-| `model_name`                     | Pre-trained model name from Hugging Face                        | meta-llama/Meta-Llama-3.1-8B|
 | `r`                              | Rank of the LoRA adapter                                        | 128                         |
 | `lora_alpha`                     | Alpha value for the LoRA module                                 | 32                          |
 | `lora_dropout`                   | Dropout rate for LoRA layers                                    | 0                           |

 |----------------------------------|-----------------------------------------------------------------|-----------------------------|
 | `max_seq_length`                 | Maximum sequence length for the model                           | 4096                        |
 | `load_in_4bit`                   | Whether to load the model in 4-bit precision                    | False                       |
+| `model_name`                     | Pre-trained model name from Hugging Face                        | meta-llama/Meta-Llama-3.1-8B-Instruct|
 | `r`                              | Rank of the LoRA adapter                                        | 128                         |
 | `lora_alpha`                     | Alpha value for the LoRA module                                 | 32                          |
 | `lora_dropout`                   | Dropout rate for LoRA layers                                    | 0                           |