Request to add support for gradient checkpoint

#12
by meetdoshi90 - opened

Hi,
Can you please also add support for gradient checkpointing with transformers? The current modeling class does not support it.

Adding this supports_gradient_checkpointing = True below line 203 should fix it.

Thanks.

Sign up or log in to comment