softhell commited on
Commit
364cf35
·
verified ·
1 Parent(s): bb05017

codet5_base_train_e7

Browse files
Files changed (3) hide show
  1. README.md +12 -10
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -23,7 +23,7 @@ model-index:
23
  metrics:
24
  - name: Bleu
25
  type: bleu
26
- value: 0.013233021060148939
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -33,8 +33,8 @@ should probably proofread and complete it, then remove this comment. -->
33
 
34
  This model is a fine-tuned version of [Salesforce/codet5-base](https://huggingface.co/Salesforce/codet5-base) on the code_search_net dataset.
35
  It achieves the following results on the evaluation set:
36
- - Loss: 0.9120
37
- - Bleu: 0.0132
38
 
39
  ## Model description
40
 
@@ -55,25 +55,27 @@ More information needed
55
  The following hyperparameters were used during training:
56
  - learning_rate: 5e-05
57
  - train_batch_size: 16
58
- - eval_batch_size: 16
59
  - seed: 42
60
  - gradient_accumulation_steps: 4
61
  - total_train_batch_size: 64
62
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
63
  - lr_scheduler_type: linear
64
  - lr_scheduler_warmup_ratio: 0.1
65
- - num_epochs: 5
66
  - mixed_precision_training: Native AMP
67
 
68
  ### Training results
69
 
70
  | Training Loss | Epoch | Step | Validation Loss | Bleu |
71
  |:-------------:|:------:|:----:|:---------------:|:------:|
72
- | 1.0906 | 1.0 | 1002 | 0.9715 | 0.0107 |
73
- | 0.9922 | 2.0 | 2004 | 0.9390 | 0.0108 |
74
- | 0.9325 | 3.0 | 3006 | 0.9233 | 0.0113 |
75
- | 0.8936 | 4.0 | 4008 | 0.9134 | 0.0124 |
76
- | 0.8769 | 4.9953 | 5005 | 0.9120 | 0.0132 |
 
 
77
 
78
 
79
  ### Framework versions
 
23
  metrics:
24
  - name: Bleu
25
  type: bleu
26
+ value: 0.01865929848556658
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
33
 
34
  This model is a fine-tuned version of [Salesforce/codet5-base](https://huggingface.co/Salesforce/codet5-base) on the code_search_net dataset.
35
  It achieves the following results on the evaluation set:
36
+ - Loss: 0.9051
37
+ - Bleu: 0.0187
38
 
39
  ## Model description
40
 
 
55
  The following hyperparameters were used during training:
56
  - learning_rate: 5e-05
57
  - train_batch_size: 16
58
+ - eval_batch_size: 8
59
  - seed: 42
60
  - gradient_accumulation_steps: 4
61
  - total_train_batch_size: 64
62
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
63
  - lr_scheduler_type: linear
64
  - lr_scheduler_warmup_ratio: 0.1
65
+ - num_epochs: 7
66
  - mixed_precision_training: Native AMP
67
 
68
  ### Training results
69
 
70
  | Training Loss | Epoch | Step | Validation Loss | Bleu |
71
  |:-------------:|:------:|:----:|:---------------:|:------:|
72
+ | 1.111 | 1.0 | 1002 | 0.9781 | 0.0108 |
73
+ | 0.998 | 2.0 | 2004 | 0.9397 | 0.0109 |
74
+ | 0.9295 | 3.0 | 3006 | 0.9204 | 0.0120 |
75
+ | 0.8814 | 4.0 | 4008 | 0.9088 | 0.0159 |
76
+ | 0.8557 | 5.0 | 5010 | 0.9064 | 0.0171 |
77
+ | 0.8364 | 6.0 | 6012 | 0.9055 | 0.0180 |
78
+ | 0.8184 | 6.9933 | 7007 | 0.9051 | 0.0187 |
79
 
80
 
81
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d74d7f8df141d0f53996700750693771b6f8aedeed3981d0a9e4c922ce32460f
3
  size 891638568
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5d6155aa929b67c581637d2a93a789bd7c6d5f36dab36c0639158feead435206
3
  size 891638568
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:353600229a52d1bc40f4e2190c68b4e1e545a3595edbe90c69597e2cc3abf322
3
  size 5496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e9e6e90433f234f69fc1383db1ae79466a3171c73f1ab31cd9ddbdc3f15ba214
3
  size 5496