amazingvince commited on
Commit
358d8e9
·
verified ·
1 Parent(s): 009ec49

Model save

Browse files
Files changed (1) hide show
  1. README.md +10 -12
README.md CHANGED
@@ -11,13 +11,13 @@ model-index:
11
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
  should probably proofread and complete it, then remove this comment. -->
13
 
14
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/amazingvince/huggingface/runs/213lyzv9)
15
  # jina_embeddings_v2_base_code_multi_regression-simple
16
 
17
- This model is a fine-tuned version of [jinaai/jina-embeddings-v2-base-code](https://huggingface.co/jinaai/jina-embeddings-v2-base-code) on the amazingvince/the-stack-smol-xs-scored-and-annotated-all dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.6117
20
- - Mse: 0.6117
21
 
22
  ## Model description
23
 
@@ -41,22 +41,20 @@ The following hyperparameters were used during training:
41
  - eval_batch_size: 2
42
  - seed: 90085
43
  - distributed_type: multi-GPU
44
- - gradient_accumulation_steps: 16
45
- - total_train_batch_size: 32
46
  - optimizer: Adam with betas=(0.9,0.98) and epsilon=1e-09
47
  - lr_scheduler_type: linear
48
  - lr_scheduler_warmup_ratio: 0.05
49
- - num_epochs: 2.0
50
 
51
  ### Training results
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Mse |
54
  |:-------------:|:------:|:----:|:---------------:|:------:|
55
- | 0.5055 | 0.3871 | 100 | 0.6117 | 0.6117 |
56
- | 0.5288 | 0.7743 | 200 | 0.6113 | 0.6113 |
57
- | 0.5981 | 1.1614 | 300 | 0.6117 | 0.6117 |
58
- | 0.6077 | 1.5485 | 400 | 0.6125 | 0.6125 |
59
- | 0.6943 | 1.9356 | 500 | 0.6117 | 0.6117 |
60
 
61
 
62
  ### Framework versions
 
11
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
  should probably proofread and complete it, then remove this comment. -->
13
 
14
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/amazingvince/huggingface/runs/atvo6d3z)
15
  # jina_embeddings_v2_base_code_multi_regression-simple
16
 
17
+ This model is a fine-tuned version of [jinaai/jina-embeddings-v2-base-code](https://huggingface.co/jinaai/jina-embeddings-v2-base-code) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.6125
20
+ - Mse: 0.6125
21
 
22
  ## Model description
23
 
 
41
  - eval_batch_size: 2
42
  - seed: 90085
43
  - distributed_type: multi-GPU
44
+ - gradient_accumulation_steps: 32
45
+ - total_train_batch_size: 64
46
  - optimizer: Adam with betas=(0.9,0.98) and epsilon=1e-09
47
  - lr_scheduler_type: linear
48
  - lr_scheduler_warmup_ratio: 0.05
49
+ - num_epochs: 3.0
50
 
51
  ### Training results
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Mse |
54
  |:-------------:|:------:|:----:|:---------------:|:------:|
55
+ | 0.5611 | 0.7743 | 100 | 0.6137 | 0.6137 |
56
+ | 0.6542 | 1.5485 | 200 | 0.6139 | 0.6139 |
57
+ | 0.5106 | 2.3228 | 300 | 0.6125 | 0.6125 |
 
 
58
 
59
 
60
  ### Framework versions