afrias5
/

CodeLlamaL4

Generated from Trainer

Model card Files Files and versions Community

afrias5 commited on Jul 21

Commit

b951e69

•

1 Parent(s): a64e663

End of training

Files changed (1) hide show

README.md +6 -6

README.md CHANGED Viewed

@@ -34,8 +34,8 @@ datasets:
 dataset_prepared_path: FinUpTagsNoTestNoExNewCodeLlama
 val_set_size: 0
 output_dir: models/codellama34bTestL4
-# lora_model_dir: models/codellamaTestLora4
-# auto_resume_from_checkpoints: true
 sequence_len: 4096
 sample_packing: true
 pad_to_sequence_len: true
@@ -54,12 +54,12 @@ wandb_project: 'codellamaFeed'
 wandb_entity:
 wandb_watch:
 wandb_run_id:
-wandb_name: '34bLora4'
 wandb_log_model:
 gradient_accumulation_steps: 4
 micro_batch_size: 1
-num_epochs: 4
 optimizer: adamw_torch
 lr_scheduler: cosine
 learning_rate: 0.0002
@@ -98,7 +98,7 @@ special_tokens:
 </details><br>
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/afrias5/codellamaFeed/runs/qazhmiqa)
 # CodeLlamaL4
 This model is a fine-tuned version of [codellama/CodeLlama-34b-Python-hf](https://huggingface.co/codellama/CodeLlama-34b-Python-hf) on the None dataset.
@@ -132,7 +132,7 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 10
-- num_epochs: 4
 ### Training results

 dataset_prepared_path: FinUpTagsNoTestNoExNewCodeLlama
 val_set_size: 0
 output_dir: models/codellama34bTestL4
+lora_model_dir: models/codellama34bTestL4/checkpoint-40
+auto_resume_from_checkpoints: true
 sequence_len: 4096
 sample_packing: true
 pad_to_sequence_len: true
 wandb_entity:
 wandb_watch:
 wandb_run_id:
+wandb_name: '34bLora4'
 wandb_log_model:
 gradient_accumulation_steps: 4
 micro_batch_size: 1
+num_epochs: 8
 optimizer: adamw_torch
 lr_scheduler: cosine
 learning_rate: 0.0002
 </details><br>
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/afrias5/codellamaFeed/runs/81byeenq)
 # CodeLlamaL4
 This model is a fine-tuned version of [codellama/CodeLlama-34b-Python-hf](https://huggingface.co/codellama/CodeLlama-34b-Python-hf) on the None dataset.
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 10
+- num_epochs: 8
 ### Training results