Training in progress, step 99
Browse files
README.md
CHANGED
@@ -58,7 +58,7 @@ Peak GPU Memory: 1.2453 GB
|
|
58 |
### Model Results
|
59 |
| epoch | step | eval_enwikippl | eval_frwikippl | eval_loss | eval_runtime | eval_samples_per_second | eval_steps_per_second | eval_zhwikippl |
|
60 |
| --- | --- | --- | --- | --- | --- | --- | --- | --- |
|
61 |
-
| |
|
62 |
| 0 | 0 | 53288.7773 | 55702.1719 | 0.0041 | 0.0758 | 13.185 | 13.185 | 55025.875 |
|
63 |
| 0.4040 | 40 | 20265.3535 | 39300.7383 | 0.0004 | 0.0554 | 18.059 | 18.059 | 53151.6875 |
|
64 |
| 0.8081 | 80 | 17527.1328 | 38131.125 | 0.0004 | 0.0553 | 18.096 | 18.096 | 51728.4688 |
|
|
|
58 |
### Model Results
|
59 |
| epoch | step | eval_enwikippl | eval_frwikippl | eval_loss | eval_runtime | eval_samples_per_second | eval_steps_per_second | eval_zhwikippl |
|
60 |
| --- | --- | --- | --- | --- | --- | --- | --- | --- |
|
61 |
+
| | teacher | 30.2266 | 57.3005 | | | | | 18.1903 |
|
62 |
| 0 | 0 | 53288.7773 | 55702.1719 | 0.0041 | 0.0758 | 13.185 | 13.185 | 55025.875 |
|
63 |
| 0.4040 | 40 | 20265.3535 | 39300.7383 | 0.0004 | 0.0554 | 18.059 | 18.059 | 53151.6875 |
|
64 |
| 0.8081 | 80 | 17527.1328 | 38131.125 | 0.0004 | 0.0553 | 18.096 | 18.096 | 51728.4688 |
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 248894656
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3f3f797494202b20be88011ba13f914bde2da77d669e70660e12851df271199f
|
3 |
size 248894656
|
runs/Aug05_22-09-56_232a0f8c3879/events.out.tfevents.1722895872.232a0f8c3879
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b199e9ff544af316aaff108dc1d0d6048f1505eaebf7a51d5a82737787b4eb25
|
3 |
+
size 9285
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 907106628
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:458874cfe93c1e38b7ad9045773cf4c7e762245a64a49cd53865e60179d24ccf
|
3 |
size 907106628
|