About attention mask.

#51
by iqdddd - opened

It might not be obvious, but BFL had some oversight during pre-training where they forgot to mask both T5 and MMDiT tokens.

Can you clarify this point? It's not obvious at all.

Sign up or log in to comment