In the “Tensor parallelism with column linear + row Linear” diagram, are Y2_0 and Y2_1 calculated incorrectly?
Shouldn't Y1_0 @W2_0 be
Y1_0 @W2_0
Y2_0 = [[200, 600], [800, 2400], [1400,4200], [2000,6000]]
· Sign up or log in to comment