YufeiWeng commited on
Commit
ae62c5a
·
verified ·
1 Parent(s): 4bc487b

Model save

Browse files
Files changed (1) hide show
  1. README.md +12 -15
README.md CHANGED
@@ -1,9 +1,6 @@
1
  ---
2
- license: apache-2.0
3
- base_model: microsoft/beit-base-patch16-224-pt22k-ft22k
4
  tags:
5
- - image-classification
6
- - vision
7
  - generated_from_trainer
8
  metrics:
9
  - f1
@@ -17,9 +14,9 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  # donut-base-beans
19
 
20
- This model is a fine-tuned version of [microsoft/beit-base-patch16-224-pt22k-ft22k](https://huggingface.co/microsoft/beit-base-patch16-224-pt22k-ft22k) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 0.0517
23
  - F1: 0.3303
24
 
25
  ## Model description
@@ -40,8 +37,8 @@ More information needed
40
 
41
  The following hyperparameters were used during training:
42
  - learning_rate: 3e-05
43
- - train_batch_size: 16
44
- - eval_batch_size: 16
45
  - seed: 1337
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
@@ -49,13 +46,13 @@ The following hyperparameters were used during training:
49
 
50
  ### Training results
51
 
52
- | Training Loss | Epoch | Step | F1 | Validation Loss |
53
- |:-------------:|:-----:|:-----:|:---:|:---------------:|
54
- | 0.0672 | 1.0 | 13466 | 0.0 | 0.0696 |
55
- | 0.0612 | 2.0 | 26932 | 0.0 | 0.0588 |
56
- | 0.0496 | 3.0 | 40398 | 0.0 | 0.0556 |
57
- | 0.0443 | 4.0 | 53864 | 0.0 | 0.0535 |
58
- | 0.0683 | 5.0 | 67330 | 0.0 | 0.0517 |
59
 
60
 
61
  ### Framework versions
 
1
  ---
2
+ base_model: microsoft/dit-base-finetuned-rvlcdip
 
3
  tags:
 
 
4
  - generated_from_trainer
5
  metrics:
6
  - f1
 
14
 
15
  # donut-base-beans
16
 
17
+ This model is a fine-tuned version of [microsoft/dit-base-finetuned-rvlcdip](https://huggingface.co/microsoft/dit-base-finetuned-rvlcdip) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.0413
20
  - F1: 0.3303
21
 
22
  ## Model description
 
37
 
38
  The following hyperparameters were used during training:
39
  - learning_rate: 3e-05
40
+ - train_batch_size: 64
41
+ - eval_batch_size: 64
42
  - seed: 1337
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
 
46
 
47
  ### Training results
48
 
49
+ | Training Loss | Epoch | Step | Validation Loss | F1 |
50
+ |:-------------:|:-----:|:-----:|:---------------:|:------:|
51
+ | 0.0584 | 1.0 | 3367 | 0.0530 | 0.3303 |
52
+ | 0.0679 | 2.0 | 6734 | 0.0460 | 0.3303 |
53
+ | 0.0508 | 3.0 | 10101 | 0.0452 | 0.3303 |
54
+ | 0.0539 | 4.0 | 13468 | 0.0435 | 0.3303 |
55
+ | 0.0568 | 5.0 | 16835 | 0.0413 | 0.3303 |
56
 
57
 
58
  ### Framework versions