flan-t5-base-samsum-tiny

This model is a fine-tuned version of google/flan-t5-base on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

learning_rate: 5e-05
train_batch_size: 8
eval_batch_size: 8
seed: 42
optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: linear
num_epochs: 5

Training Loss	Epoch	Step	Validation Loss	Rouge1	Rouge2	Rougel	Rougelsum	Gen Len
No log	1.0	13	1.5191	46.4667	22.8802	39.5714	42.4271	16.76
No log	2.0	26	1.5157	46.9566	22.5577	39.2871	42.8101	17.26
No log	3.0	39	1.5151	47.2094	22.8909	38.9786	42.8894	17.68
No log	4.0	52	1.5185	46.6053	22.5631	38.2157	42.3972	17.57
No log	5.0	65	1.5198	46.7008	22.6139	38.3914	42.6006	17.59