Toy Example breaks with CUDA on compute_convergence_delta for Integrated Gradients #163

suragnair · 2019-11-07T07:43:41Z

For the toy example with cuda

model = ToyModel()
model = model.cuda()
model.eval()

input = torch.rand(2, 3).cuda()
baseline = torch.zeros(2, 3).cuda()

ig = IntegratedGradients(model)
attributions, delta = ig.attribute(input, baseline, target=0, return_convergence_delta=True)

fails with the error

~/anaconda3/envs/heterokaryon/lib/python3.7/site-packages/captum/attr/_utils/attribution.py in compute_convergence_delta(self, attributions, start_point, end_point, target, additional_forward_args)
    232         row_sums = [_sum_rows(attribution) for attribution in attributions]
    233         attr_sum = torch.tensor([sum(row_sum) for row_sum in zip(*row_sums)])
--> 234         return attr_sum - (end_point - start_point)
    235 
    236 

RuntimeError: expected device cpu and dtype Float but got device cuda:0 and dtype Float

presumably since attr_sum is not on GPU. Turning return_convergence_delta to False results in no error.

Similar issues may arise in other places, though I haven't checked.

The text was updated successfully, but these errors were encountered:

vivekmig · 2019-11-07T15:34:49Z

Hi @suragnair, thanks for pointing out this bug! We will push a fix for this soon.

Summary: Convergence delta was failing on GPUs since the total attribution tensor was not being created on the same device, this fixes that bug. Also, adds data parallel / CUDA tests for computing deltas whenever available. Issue: #163 Pull Request resolved: #169 Differential Revision: D18440711 Pulled By: vivekmig fbshipit-source-id: 5b67ba75492eb3c6933d13e1e67914bf1d3e5241

Summary: Convergence delta was failing on GPUs since the total attribution tensor was not being created on the same device, this fixes that bug. Also, adds data parallel / CUDA tests for computing deltas whenever available. Issue: pytorch#163 Pull Request resolved: pytorch#169 Differential Revision: D18440711 Pulled By: vivekmig fbshipit-source-id: 5b67ba75492eb3c6933d13e1e67914bf1d3e5241

suragnair changed the title ~~Toy Example (and others) break with CUDA on compute_convergence_delta~~ Toy Example breaks with CUDA on compute_convergence_delta Nov 7, 2019

suragnair changed the title ~~Toy Example breaks with CUDA on compute_convergence_delta~~ Toy Example breaks with CUDA on compute_convergence_delta Nov 7, 2019

suragnair changed the title ~~Toy Example breaks with CUDA on compute_convergence_delta~~ Toy Example breaks with CUDA on compute_convergence_delta for Integrated Gradients Nov 7, 2019

vivekmig self-assigned this Nov 7, 2019

vivekmig added bug Something isn't working triaged labels Nov 7, 2019

vivekmig mentioned this issue Nov 11, 2019

Fixing CUDA delta issue #169

Closed

vivekmig closed this as completed Nov 12, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Toy Example breaks with CUDA on compute_convergence_delta for Integrated Gradients #163

Toy Example breaks with CUDA on compute_convergence_delta for Integrated Gradients #163

suragnair commented Nov 7, 2019 •

edited

Loading

vivekmig commented Nov 7, 2019

Toy Example breaks with CUDA on compute_convergence_delta for Integrated Gradients #163

Toy Example breaks with CUDA on compute_convergence_delta for Integrated Gradients #163

Comments

suragnair commented Nov 7, 2019 • edited Loading

vivekmig commented Nov 7, 2019

suragnair commented Nov 7, 2019 •

edited

Loading