End of training
Browse files
README.md
CHANGED
@@ -1,10 +1,10 @@
|
|
1 |
---
|
2 |
-
base_model: distilbert/distilgpt2
|
3 |
-
datasets:
|
4 |
-
- eli5_category
|
5 |
license: apache-2.0
|
|
|
6 |
tags:
|
7 |
- generated_from_trainer
|
|
|
|
|
8 |
model-index:
|
9 |
- name: my_awesome_eli5_clm_model
|
10 |
results: []
|
@@ -13,12 +13,14 @@ model-index:
|
|
13 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
14 |
should probably proofread and complete it, then remove this comment. -->
|
15 |
|
|
|
|
|
16 |
[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>]()
|
17 |
# my_awesome_eli5_clm_model
|
18 |
|
19 |
This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
|
20 |
It achieves the following results on the evaluation set:
|
21 |
-
- Loss: 3.
|
22 |
|
23 |
## Model description
|
24 |
|
@@ -49,9 +51,9 @@ The following hyperparameters were used during training:
|
|
49 |
|
50 |
| Training Loss | Epoch | Step | Validation Loss |
|
51 |
|:-------------:|:-----:|:----:|:---------------:|
|
52 |
-
| 3.
|
53 |
-
| 3.
|
54 |
-
| 3.
|
55 |
|
56 |
|
57 |
### Framework versions
|
|
|
1 |
---
|
|
|
|
|
|
|
2 |
license: apache-2.0
|
3 |
+
base_model: distilbert/distilgpt2
|
4 |
tags:
|
5 |
- generated_from_trainer
|
6 |
+
datasets:
|
7 |
+
- eli5_category
|
8 |
model-index:
|
9 |
- name: my_awesome_eli5_clm_model
|
10 |
results: []
|
|
|
13 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
14 |
should probably proofread and complete it, then remove this comment. -->
|
15 |
|
16 |
+
[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>]()
|
17 |
+
[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>]()
|
18 |
[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>]()
|
19 |
# my_awesome_eli5_clm_model
|
20 |
|
21 |
This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
|
22 |
It achieves the following results on the evaluation set:
|
23 |
+
- Loss: 3.8084
|
24 |
|
25 |
## Model description
|
26 |
|
|
|
51 |
|
52 |
| Training Loss | Epoch | Step | Validation Loss |
|
53 |
|:-------------:|:-----:|:----:|:---------------:|
|
54 |
+
| 3.9747 | 1.0 | 565 | 3.8206 |
|
55 |
+
| 3.8978 | 2.0 | 1130 | 3.8102 |
|
56 |
+
| 3.8566 | 3.0 | 1695 | 3.8084 |
|
57 |
|
58 |
|
59 |
### Framework versions
|
runs/Aug04_09-01-54_e8185cfad283/events.out.tfevents.1722762344.e8185cfad283.34.3
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e1c907e363eaa7128ab14c8be29910db29efd9a35592cfaf0bfeff08181777ac
|
3 |
+
size 7021
|