I do not, but you could change the code to dispatch a 8 rows GEMM to the dense MFMA with 8 rows of padding and check the numbers then!
Your understanding of the dispatch logic is correct.
Rémi Ouazan Reboul
ror
AI & ML interests
None yet
Recent Activity
commented on
their
article
5 days ago
Creating custom kernels for the AMD MI300
liked
a Space
15 days ago
eustlb/transformers-audio-ci
updated
a dataset
23 days ago
huggingface/documentation-images