Fix gradient requirements for layer methods #647

vivekmig · 2021-04-05T22:39:53Z

This updates gradient requirements to be set on layer inputs / outputs rather than original inputs, which ensures that gradient requirements are set when inputs are non-floating point (e.g. token indices). This also avoids unnecessarily requiring gradients between the input and target layer, when only layer gradients are required.

facebook-github-bot · 2021-04-06T20:54:52Z

@vivekmig has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

NarineK

LGTM! Thank you for the fix. Couple nits and questions.

NarineK · 2021-04-08T03:41:16Z

captum/_utils/gradient.py

@@ -23,7 +23,9 @@
 )


-def apply_gradient_requirements(inputs: Tuple[Tensor, ...]) -> List[bool]:
+def apply_gradient_requirements(
+    inputs: Tuple[Tensor, ...], warn: bool = False


nit: In order to support original behavior don't we want to not warn only when we know that warning is not necessary in case of layer approaches ?

Great catch, thanks! Yes, you're definitely right, meant to set the default to True

captum/_utils/gradient.py

facebook-github-bot · 2021-04-09T19:42:31Z

@vivekmig has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2021-04-12T21:52:04Z

@vivekmig merged this pull request in d630574.

vivekmig added 2 commits April 5, 2021 15:37

Fixes

436a3fc

revert init

23ebbc8

facebook-github-bot added the cla signed label Apr 5, 2021

vivekmig mentioned this pull request Apr 5, 2021

Fix layer_gradient_x_activation and add logging for metrics #643

Closed

vivekmig added 2 commits April 5, 2021 15:44

Fix grad params

1b19bcb

Fix other methods

9fd348d

vivekmig changed the title ~~WIP: Fix gradient requirements for layer methods~~ Fix gradient requirements for layer methods Apr 6, 2021

vivekmig requested a review from NarineK April 6, 2021 20:54

NarineK approved these changes Apr 8, 2021

View reviewed changes

Fix default

363ce6d

facebook-github-bot closed this in d630574 Apr 12, 2021

facebook-github-bot added the Merged label Apr 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix gradient requirements for layer methods #647

Fix gradient requirements for layer methods #647

vivekmig commented Apr 5, 2021 •

edited

Loading

facebook-github-bot commented Apr 6, 2021

NarineK left a comment

NarineK Apr 8, 2021

vivekmig Apr 9, 2021

facebook-github-bot commented Apr 9, 2021

facebook-github-bot commented Apr 12, 2021

Fix gradient requirements for layer methods #647

Fix gradient requirements for layer methods #647

Conversation

vivekmig commented Apr 5, 2021 • edited Loading

facebook-github-bot commented Apr 6, 2021

NarineK left a comment

Choose a reason for hiding this comment

NarineK Apr 8, 2021

Choose a reason for hiding this comment

vivekmig Apr 9, 2021

Choose a reason for hiding this comment

facebook-github-bot commented Apr 9, 2021

facebook-github-bot commented Apr 12, 2021

vivekmig commented Apr 5, 2021 •

edited

Loading