End of training
Browse files
README.md
CHANGED
@@ -164,7 +164,7 @@ The following hyperparameters were used during training:
|
|
164 |
weight=0
|
165 |
)
|
166 |
)`
|
167 |
-
- lr_scheduler: `<torch.optim.lr_scheduler.LambdaLR object at
|
168 |
- student_model_name_or_path: `None`
|
169 |
- student_config_name_or_path: `None`
|
170 |
- student_model_config: `{'num_hidden_layers': 15}`
|
|
|
164 |
weight=0
|
165 |
)
|
166 |
)`
|
167 |
+
- lr_scheduler: `<torch.optim.lr_scheduler.LambdaLR object at 0x7520daf28d30>`
|
168 |
- student_model_name_or_path: `None`
|
169 |
- student_config_name_or_path: `None`
|
170 |
- student_model_config: `{'num_hidden_layers': 15}`
|
logs/learning_rate=0.0001, lr_scheduler_kwargs=__power___1.5___lr_end___2e-05_, lr_scheduler_type=polynomial, per_device_train_batch_size=8, warmup_ratio=0.1/events.out.tfevents.1726790010.1c1a426a2fee
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:1a68a9b1df6707a4b4a0b181cf9f325f575ddbf6c0d5d289263bc8ca0645a4f8
|
3 |
+
size 529
|