meta-llama/CodeLlama-7b-Instruct-hf-FaVe-rank32-2epochs-v2

Files changed (4) hide show

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [meta-llama/CodeLlama-7b-Instruct-hf](https://huggingface.co/meta-llama/CodeLlama-7b-Instruct-hf) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3363
 ## Model description
@@ -52,13 +52,13 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| No log        | 0.2685 | 10   | 1.4056          |
-| 1.6644        | 0.5369 | 20   | 0.6569          |
-| 1.6644        | 0.8054 | 30   | 0.5326          |
-| 0.61          | 1.0738 | 40   | 0.4391          |
-| 0.61          | 1.3423 | 50   | 0.3860          |
-| 0.46          | 1.6107 | 60   | 0.3620          |
-| 0.46          | 1.8792 | 70   | 0.3363          |
 ### Framework versions

 This model is a fine-tuned version of [meta-llama/CodeLlama-7b-Instruct-hf](https://huggingface.co/meta-llama/CodeLlama-7b-Instruct-hf) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3588
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| No log        | 0.2685 | 10   | 1.3789          |
+| 1.6221        | 0.5369 | 20   | 0.6015          |
+| 1.6221        | 0.8054 | 30   | 0.5153          |
+| 0.6128        | 1.0738 | 40   | 0.4366          |
+| 0.6128        | 1.3423 | 50   | 0.4192          |
+| 0.4311        | 1.6107 | 60   | 0.3731          |
+| 0.4311        | 1.8792 | 70   | 0.3588          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,8 +20,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "q_proj",
-    "v_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "v_proj",
+    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b083892ca0fd38d8b27ff671d8c16a483f934fc9a54e36501444d2100dabfe9a
 size 33571624

 version https://git-lfs.github.com/spec/v1
+oid sha256:c5f6ebf8023204e8205a2854d452dfe1866257de53d55ab4dad5597832fc7274
 size 33571624

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:05071fc7106cbbca05d04e92a3d3fe2ef0aa829cf7fffd50d1babf4f54dd969f
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:2cfdeff385dd2dfc4278e0e4f1c65b64c87dd70f45369f818c5c4471f9e49fb5
 size 5112