Discussion
Collection
13 items
โข
Updated
This model is a fine-tuned version of microsoft/phi-4 on an unknown dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss |
---|---|---|---|
2.6764 | 0.2235 | 10 | 2.4496 |
2.1053 | 0.4469 | 20 | 1.9257 |
1.222 | 0.6704 | 30 | 1.0594 |
0.1878 | 0.8939 | 40 | 0.1615 |
0.1642 | 1.1117 | 50 | 0.1395 |
0.1127 | 1.3352 | 60 | 0.1343 |
0.1483 | 1.5587 | 70 | 0.1332 |
0.1342 | 1.7821 | 80 | 0.1338 |
0.1529 | 2.0 | 90 | 0.1323 |
0.1327 | 2.2235 | 100 | 0.1289 |
0.095 | 2.4469 | 110 | 0.1286 |
0.1446 | 2.6704 | 120 | 0.1304 |
0.1631 | 2.8939 | 130 | 0.1265 |
Base model
microsoft/phi-4