distily_multi_experiment / logs /attn_loss_fn=cos, attn_weight=5, layer_mapper=layer-2, projector=orthogonal

This model has 1 file scanned as unsafe.

lapp0's picture
Training in progress, step 61875
9f73e81 verified