How to understand the graph "Tensor parallelism with column linear + row Linear"

#109
by Yihel - opened

In the “Tensor parallelism with column linear + row Linear” diagram, are Y2_0 and Y2_1 calculated incorrectly?

Shouldn't Y1_0 @W2_0 be

Y2_0 = [[200,  600],
      [800, 2400],
      [1400,4200],
      [2000,6000]]
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment