Avoid calling `flatten` in `TracInCP.self_influence` for efficiency #1087

diegoolano · 2022-12-08T16:57:21Z

Summary:
We are currently unnecessarily calling flatten in TracInCP.self_influence at https://www.internalfb.com/code/fbsource/[90a997b1774c]/fbcode/pytorch/captum/captum/influence/_core/tracincp.py?lines=1071-1073

In particular, the purpose of those 3 lines is to sum over all dimensions in layer_jacobian ** 2 besides the 0-th dimension, where the 0-th dimension is the batch dimension, and the remaining dimensions correspond to a specific parameter's dimensions (layer_jacobians stores the per-example jacobian of a given parameter, so that its dimension is 1-greater than the dimension of the given parameter, due to the need to account for different examples in the batch). However, we currently do that by flattening all dimensions of layer_jacobians except the 0-th (batch) dimension, then summing across all dimensions except the 0-th dimension (i.e. the 1st dimension). Flattening may sometimes incur overhead if data needs to be copied (when a view cannot be returned). We cannot guarantee that a view can be returned since the parameters can be arbitrary. Thus, there is always the possibility of unnecessary overhead.

Therefore, for this diff, you could address that inefficiency by making TracInCP.self_influence not use flatten. One option is to explicitly create layer_jacobian ** 2, and then sum over all dimensions except the 0-th dimension. Another option is to leverage torch.linalg.norm, choosing the dim option to consider the norm across all dimensions except the 0-th dimension, and then taking the square.

No new tests are needed. It would be interesting to benchmark the efficiency gains from making this change, but this is just a nice-to-have, as we already know that both above alternatives are better than using flatten.

Reviewed By: 99warriors

Differential Revision: D41140914

Summary: We are currently unnecessarily calling flatten in TracInCP.self_influence at https://www.internalfb.com/code/fbsource/[90a997b1774c]/fbcode/pytorch/captum/captum/influence/_core/tracincp.py?lines=1071-1073 In particular, the purpose of those 3 lines is to sum over all dimensions in layer_jacobian ** 2 besides the 0-th dimension, where the 0-th dimension is the batch dimension, and the remaining dimensions correspond to a specific parameter's dimensions (layer_jacobians stores the per-example jacobian of a given parameter, so that its dimension is 1-greater than the dimension of the given parameter, due to the need to account for different examples in the batch). However, we currently do that by flattening all dimensions of layer_jacobians except the 0-th (batch) dimension, then summing across all dimensions except the 0-th dimension (i.e. the 1st dimension). Flattening may sometimes incur overhead if data needs to be copied (when a view cannot be returned). We cannot guarantee that a view can be returned since the parameters can be arbitrary. Thus, there is always the possibility of unnecessary overhead. Therefore, for this diff, you could address that inefficiency by making TracInCP.self_influence not use flatten. One option is to explicitly create layer_jacobian ** 2, and then sum over all dimensions except the 0-th dimension. Another option is to leverage torch.linalg.norm, choosing the dim option to consider the norm across all dimensions except the 0-th dimension, and then taking the square. No new tests are needed. It would be interesting to benchmark the efficiency gains from making this change, but this is just a nice-to-have, as we already know that both above alternatives are better than using flatten. Reviewed By: 99warriors Differential Revision: D41140914 fbshipit-source-id: 04c1e300a2baa81843f6153421de6b395082c7b5

facebook-github-bot · 2022-12-08T16:57:51Z

This pull request was exported from Phabricator. Differential Revision: D41140914

facebook-github-bot · 2022-12-09T17:43:27Z

This pull request has been merged in cb44edd.

facebook-github-bot added cla signed fb-exported labels Dec 8, 2022

facebook-github-bot closed this in cb44edd Dec 9, 2022

facebook-github-bot added the Merged label Dec 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid calling `flatten` in `TracInCP.self_influence` for efficiency #1087

Avoid calling `flatten` in `TracInCP.self_influence` for efficiency #1087

diegoolano commented Dec 8, 2022

facebook-github-bot commented Dec 8, 2022

facebook-github-bot commented Dec 9, 2022

Avoid calling flatten in TracInCP.self_influence for efficiency #1087

Avoid calling flatten in TracInCP.self_influence for efficiency #1087

Conversation

diegoolano commented Dec 8, 2022

facebook-github-bot commented Dec 8, 2022

facebook-github-bot commented Dec 9, 2022

Avoid calling `flatten` in `TracInCP.self_influence` for efficiency #1087

Avoid calling `flatten` in `TracInCP.self_influence` for efficiency #1087