Request to add support for gradient checkpoint
#12
by
meetdoshi90
- opened
Hi,
Can you please also add support for gradient checkpointing with transformers? The current modeling class does not support it.
Adding this supports_gradient_checkpointing = True
below line 203 should fix it.
Thanks.