nileshmalpeddi/output_exaone_rank8_newdata_b4_g2_gradient2_e1_b1024 Text Generation • 2B • Updated Mar 21 • 9
nileshmalpeddi/output_exaone_rank16_newdata_b4_g4_gradient2_e1_b1024 Text Generation • 3B • Updated Mar 13 • 6
nileshmalpeddi/output_exaone_tucker_rank16_newdata_b1_g2_gradient2_e1_b512 Text Generation • 3B • Updated Mar 14 • 8
nileshmalpeddi/output_exaone_tucker_rank16_newdata_b2_g4_s2_e1_b512 Text Generation • 3B • Updated Mar 14 • 5
nileshmalpeddi/output_exaone_tucker_rank16_newdata_b1_g2_gradient1_e1_b512 Text Generation • 3B • Updated Mar 15 • 6
nileshmalpeddi/output_exaone_tucker_alllayer_rank16_newdata_b2_g4_s2_e1_b512 Text Generation • 0.5B • Updated Mar 15 • 5
nileshmalpeddi/output_exaone_cp_5data_rank16_b4_g1_s10_e1_b512 Text Generation • 0.5B • Updated Mar 16 • 5
nileshmalpeddi/output_exaone_cp_bylayer_5data_rank16_b4_g1_s2_e1_b512 Text Generation • 3B • Updated Mar 16 • 6