Bug in *.attribute -- New tensor in gathering ignoring used device #316

kai-tub · 2020-03-07T09:56:40Z

Hi, first of all, thanks for providing and working on such a neat library!
I think I found a "bug" in the common.py file, which is used for most attributions. The problem arises when a Cuda device is used and a non-tensor target is given, for example, a simple list.

I don't know if you officially support CUDA or not, but as I couldn't find any hints indicating otherwise, I've used the Saliency function without any problems on a CUDA device, while I was using a tensor as a target. After some refactoring, I changed the target to a simple list and the error occurred. I traced it down to the following line:
return torch.gather(output, 1, torch.tensor(target).reshape(len(output), 1))

As it can be seen a tensor is created but no device information is used. A simple fix would be to look where the output tensor lives:

device = "cuda" if output.is_cuda else "cpu"
return torch.gather(output, 1, torch.tensor(target).reshape(len(output), 1).to(device))
# EDIT alternative:
device = output.device
return torch.gather(output, 1, torch.tensor(target, device=device).reshape(len(output), 1))

(I don't know if this could cause problems if the data lies on a different GPU, as I have no experience with multiple GPUs)

The tensor version works, as it can be moved before on the user side and only reshapes the used one:
return torch.gather(output, 1, target.reshape(len(output), 1))

I've included a minimal example highlighting the problem:

# minimal_example.py
import torch
import torchvision
import numpy as np
from captum.attr import Saliency

# device = "cpu" works!
device = "cuda"
model = torchvision.models.alexnet(pretrained=False)
model.to(device)
model.eval()
X = torch.rand(3, 3, 224, 224, dtype=torch.float)
X = X.to(device)
y_pred = model(X)
saliency = Saliency(model)
X.requires_grad = True
grads = saliency.attribute(X, target=[0, 1])
# working derivations:
# grads = saliency.attribute(X, target=torch.tensor([0, 1, 2]).to(device))
grads.shape
# The reason is the gathering command WITHOUT regarding the current device.

The text was updated successfully, but these errors were encountered:

vivekmig · 2020-03-07T14:45:32Z

Hi @kai-tub , thanks for the detailed information, you are right, this is definitely a bug! Your proposed fix with maintaining the output device looks great, would you like to create a PR making that change?

kai-tub · 2020-03-07T15:57:25Z

Yes I would like to create a PR. :)
I will take a closer look at it tomorrow.

Summary: Hey, Here is my proposed fix for [#316. I've added a test case that uses the BaseGPUTest class and simply runs the saliency target tests again, with everything on the GPU. As I did not fully understand the `_target_batch_test_assert` function and what the requirements for the tests are, I simply copied the function for now. I assume there may be an obvious way to integrate these tests. :) Another question is: Do we want to enforce the move if we receive a target tensor, which is not on the right device? We could print a User Warning similar to automatically setting require gradients as in gradient.py:33. Or should the user be responsible to move the target tensor to the correct device? Best regards, Kai Pull Request resolved: #317 Reviewed By: NarineK Differential Revision: D20371529 Pulled By: vivekmig fbshipit-source-id: 92461d358f589bf47487d6170fac6a54e1d83123

NarineK · 2020-03-12T00:34:34Z

Closing since the PR is merged! Thank you!

Summary: Hey, Here is my proposed fix for [pytorch#316. I've added a test case that uses the BaseGPUTest class and simply runs the saliency target tests again, with everything on the GPU. As I did not fully understand the `_target_batch_test_assert` function and what the requirements for the tests are, I simply copied the function for now. I assume there may be an obvious way to integrate these tests. :) Another question is: Do we want to enforce the move if we receive a target tensor, which is not on the right device? We could print a User Warning similar to automatically setting require gradients as in gradient.py:33. Or should the user be responsible to move the target tensor to the correct device? Best regards, Kai Pull Request resolved: pytorch#317 Reviewed By: NarineK Differential Revision: D20371529 Pulled By: vivekmig fbshipit-source-id: 92461d358f589bf47487d6170fac6a54e1d83123

kai-tub changed the title ~~Bug in *.attribute -- Gathering ignoring used device~~ Bug in *.attribute -- New tensor in gathering ignoring used device Mar 7, 2020

kai-tub mentioned this issue Mar 8, 2020

Select targets gpu fix #317

Closed

NarineK closed this as completed Mar 12, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug in *.attribute -- New tensor in gathering ignoring used device #316

Bug in *.attribute -- New tensor in gathering ignoring used device #316

kai-tub commented Mar 7, 2020 •

edited

Loading

vivekmig commented Mar 7, 2020 •

edited

Loading

kai-tub commented Mar 7, 2020

NarineK commented Mar 12, 2020

Bug in *.attribute -- New tensor in gathering ignoring used device #316

Bug in *.attribute -- New tensor in gathering ignoring used device #316

Comments

kai-tub commented Mar 7, 2020 • edited Loading

vivekmig commented Mar 7, 2020 • edited Loading

kai-tub commented Mar 7, 2020

NarineK commented Mar 12, 2020

kai-tub commented Mar 7, 2020 •

edited

Loading

vivekmig commented Mar 7, 2020 •

edited

Loading