Support for B200s?

#7
by shriramc - opened

I'm trying to use this kernel with B200s and am running into the error:

CUDA error (/build/source/flash-attn/flash_fwd_launch_template.h:191): no kernel image is available for execution on the device

Is there a timeline for supporting this kernel on B200s?

Same error on a RTX PRO 6000 Blackwell. I think it's mainly Blackwell GPU.
Strangly it run without issue on a H200.

Maybe the error is coming from somewhere else ?

FlashAttention3 doesn't support Blackwell, it was only made to work on Hopper GPUs. Blackwell support will come with FlashAttention4

@mgoin is there a timeline youre aware of for flashattention4? Seems like this would be blocking for models that currently require FA3

Sign up or log in to comment