Implementation of Auto Conjugate Gradient Attack #2028

yamamura-k · 2023-02-15T07:17:18Z

Description

I implemented a new attack method Auto Conjugate Gradient attack proposed in "Diversified Adversarial Attacks based on Conjugate Gradient Method", ICML2022. paper link(arxiv)

This implementation works poor when the batch size is greater than or equal to 2 because the way to treat the loss and step size condition is different from the original implementation. The implementation of APGD also has the same issue, and to fix this, we have to modify the wrapper class of the threat model (e.g. PyTorchClassifier). Due to this reason, we imitate the implementation of APGD to avoid large modification.

Type of change

Please check all relevant options.

Improvement (non-breaking)
Bug fix (non-breaking)
New feature (non-breaking)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Testing

The test of ACG is the same to that of APGD because ACG is very similar to APGD.

Whether the attack works correctly (tests/attacks/evasion/test_auto_conjugate_gradient.py)

Test Configuration:

OS: ubuntu 22.04
Python version: 3.9.13
ART version or commit number: 11,126
TensorFlow / Keras / PyTorch / MXNet version: 2.10.1 / 2.10.0 / 1.13.1 / 1.8.0.post0

Checklist

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Signed-off-by: yamamura-k <yamayama23bb@gmail.com>

beat-buesser · 2023-02-15T21:21:50Z

Hi @yamamura-k Thank you very much for contributing your attack to ART and congratulations for your ICML paper! I think your attack will be very useful for many users of ART.

What kind of changes would be required to the ART estimators to support batches of size 2 or larger with the best attack performance? Maybe we can add the required functionality.

codecov-commenter · 2023-02-15T21:28:11Z

Codecov Report

Merging #2028 (09281d0) into dev_1.14.0 (9b2891b) will increase coverage by 1.39%.
The diff coverage is 86.19%.

📣 This organization is not using Codecov’s GitHub App Integration. We recommend you install it so Codecov can continue to function properly for your repositories. Learn more

@@              Coverage Diff               @@
##           dev_1.14.0    #2028      +/-   ##
==============================================
+ Coverage       84.19%   85.59%   +1.39%     
==============================================
  Files             292      293       +1     
  Lines           25798    26158     +360     
  Branches         4665     4733      +68     
==============================================
+ Hits            21720    22389     +669     
+ Misses           2882     2554     -328     
- Partials         1196     1215      +19

Impacted Files	Coverage Δ
...attacks/evasion/auto_projected_gradient_descent.py	`85.76% <86.00%> (-0.52%)`	⬇️
art/attacks/evasion/auto_conjugate_gradient.py	`86.20% <86.20%> (ø)`
art/attacks/evasion/__init__.py	`98.21% <100.00%> (+0.03%)`	⬆️

... and 17 files with indirect coverage changes

art/attacks/evasion/auto_conjugate_gradient.py

+        # if self.loss_type not in self._predefined_losses:
+        #     raise ValueError("The argument loss_type has to be either {}.".format(self._predefined_losses))


yamamura-k · 2023-02-15T23:42:10Z

What kind of changes would be required to the ART estimators to support batches of size 2 or larger with the best attack performance? Maybe we can add the required functionality.

@beat-buesser
Thank you for considering adding this feature to estimator.
In order to improve performance, we would like to calculate the condition for the step size update decision for each image. Therefore, we want to use the objective function value for each image in the batch, not the reduced (average or sum) value.

beat-buesser · 2023-02-16T14:16:50Z

Hi @yamamura-k
The method compute_loss of the classification estimators provides an option to define the type of reduction on the loss values. Currently it is set to reduction="mean" resulting in the average loss of all samples in the batch. This option can be set to reduction="none" to return the loss for each sample separately. Would this solve the issue?

yamamura-k · 2023-02-16T15:34:03Z

@beat-buesser Thank you for your comment. I think your suggestion can solve my problem. I misunderstood that the reduction affects the behavior of loss_gradient method. Thank you again for your suggestion.

I will fix my implementation later. Also, I can modify the implementation of auto_projected_gradient_descent.py at the same time. If this modification is helpful to you, I will modify the implementation of auto_projected_gradient_descent.py and send another pull request.

yamamura-k · 2023-02-16T16:03:38Z

@beat-buesser Does the loss_gradient function work when the output of the loss is not scalar? I checked the implementation again, and I found that the output of the loss function class always returns the reduced value. So if the answer of the question is no, I think we have to change the implementation of loss_gradient or another related part.

beat-buesser · 2023-02-16T16:22:00Z

Hi @yamamura-k I think it would be amazing if you could upgrade your ACG and ART's APGD attacks to account for per sample step sizes! I think you are asking important questions, this is what I think:

reduction="none" would have to be applied in line 522, 523, and 555 to get per sample losses for the algorithms of ACG and APGD
Does the loss_gradient function work when the output of the loss is not scalar?
- I understand this question affects line 485 of the ACG attack and similarly APGD. You are right, the method loss_gradient calculates the gradients of the average loss of a batch. But because of the chain rules for derivatives of sum and division (averaging of the loss) the gradients backpropagated from the average loss per batch should still result in per sample gradients independent of the other samples in the batch. In addition the per sample gradients are normalized depending on the selected norm after calling loss_gradient. Based on this I think we should not require any changes to loss_gradient for ACM and upgraded APGD.

What do you think?

yamamura-k · 2023-02-16T22:47:31Z

@beat-buesser Thank you for your comments and suggestions.

Based on this I think we should not require any changes to loss_gradient for ACM and upgraded APGD.

I understand your opinion, and basically agree with you. However, the problem is how to satisfy the following conflicting requirements.

classifier._loss should return a vector of losses to get the loss values per samples.
classifier._loss should return the averaged losses to calculate the gradient.

I think reduction="none" cannot solve this problem directly because ART classifiers assume the output of loss is averaged over the batch (e.g. lines 843 to 856 in adversarial-robustness-toolbox/art/estimators/classification/pytorch.py). That is, the output of loss should be averaged in the definition of loss function class (like DifferenceLogitsRatioPyTorch) to calculate the gradient in the current implementation of loss_grad.

One possible solution is averaging the output of classifier._loss when the shape > (1, ) in loss_gradient function.
The code below is the example of this solution. The similar modification should be applied to the other files.

    loss = self._loss(model_outputs[-1], labels_t) #line 843 in adversarial-robustness-toolbox/art/estimators/classification/pytorch.py
    # My suggestion
    if len(loss.shape) == 1 and loss.shape[0] > 1:
            loss = loss.mean() # reduce the loss
    elif len(loss.shape) > 1:
            raise ValueError
    # !My suggestion
    # Clean gradients
    self._model.zero_grad()

    # Compute gradients
    if self._use_amp:  # pragma: no cover
        from apex import amp  # pylint: disable=E0611

        with amp.scale_loss(loss, self._optimizer) as scaled_loss:
            scaled_loss.backward()

    else:
        loss.backward()

Another possible solution is to pass the keyword argument reduction to the classifier._loss and reduce the loss value in classifier._loss according to reduction keyword. (currently, user-specified reduction is applied in compute_loss but it seems not to work because the classifier._loss returns the mean of loss value.)

These are what I'm thinking about. Would you tell me your opinion?

beat-buesser · 2023-02-17T00:56:05Z

Hi @yamamura-k

I think your considerations are correct and important.

ART expects classifier._loss to be the reduced loss (average, sum, etc.) across the batch as a single value.

Your last proposal of passing the keyword argument reduction to the classifier._loss is a good idea and I think we are doing this already in classifier.compute_loss in lines

adversarial-robustness-toolbox/art/estimators/classification/pytorch.py

Lines 743 to 748 in eadbe6d

    
           prev_reduction = self._loss.reduction 
        
           # Return individual loss values 
        
           self._loss.reduction = reduction 
        
           loss = self._loss(model_outputs[-1], labels_t) 
        
           self._loss.reduction = prev_reduction

where we save the original reduction option to prev_reduction and set the reduction option defined in the keyword argument with self._loss.reduction = reduction. In the next lines we revert self._loss.reduction to the original reduction option.

I have run a short test based on one of ART's example scripts which trains a PyTorchClassifier and runs/prints compute_loss with both options mean and none:

import torch.nn as nn
import torch.nn.functional as F
import torch.optim as optim
import numpy as np

from art.attacks.evasion import FastGradientMethod
from art.estimators.classification import PyTorchClassifier
from art.utils import load_mnist


# Step 0: Define the neural network model, return logits instead of activation in forward method


class Net(nn.Module):
    def __init__(self):
        super(Net, self).__init__()
        self.conv_1 = nn.Conv2d(in_channels=1, out_channels=4, kernel_size=5, stride=1)
        self.conv_2 = nn.Conv2d(in_channels=4, out_channels=10, kernel_size=5, stride=1)
        self.fc_1 = nn.Linear(in_features=4 * 4 * 10, out_features=100)
        self.fc_2 = nn.Linear(in_features=100, out_features=10)

    def forward(self, x):
        x = F.relu(self.conv_1(x))
        x = F.max_pool2d(x, 2, 2)
        x = F.relu(self.conv_2(x))
        x = F.max_pool2d(x, 2, 2)
        x = x.view(-1, 4 * 4 * 10)
        x = F.relu(self.fc_1(x))
        x = self.fc_2(x)
        return x


# Step 1: Load the MNIST dataset

(x_train, y_train), (x_test, y_test), min_pixel_value, max_pixel_value = load_mnist()

# Step 1a: Swap axes to PyTorch's NCHW format

x_train = np.transpose(x_train, (0, 3, 1, 2)).astype(np.float32)
x_test = np.transpose(x_test, (0, 3, 1, 2)).astype(np.float32)

# Step 2: Create the model

model = Net()

# Step 2a: Define the loss function and the optimizer

criterion = nn.CrossEntropyLoss()
optimizer = optim.Adam(model.parameters(), lr=0.01)

# Step 3: Create the ART classifier

classifier = PyTorchClassifier(
    model=model,
    clip_values=(min_pixel_value, max_pixel_value),
    loss=criterion,
    optimizer=optimizer,
    input_shape=(1, 28, 28),
    nb_classes=10,
)

# Step 4: Train the ART classifier

classifier.fit(x_train, y_train, batch_size=64, nb_epochs=3)

print("mean")
print(classifier.compute_loss(x=x_test[:10], y=y_test[:10], reduction="mean"))

print("none")
print(classifier.compute_loss(x=x_test[:10], y=y_test[:10], reduction="none"))

This script should print something like

mean
0.011778888
none
[ 7.0333235e-06  1.9073468e-06  2.7418098e-06  1.0013530e-05
  1.1394425e-02 -0.0000000e+00  2.2188823e-03  2.7656173e-05
  1.0411299e-01  1.3232144e-05]

Would this provide the required loss values for ACG attack?

yamamura-k · 2023-02-17T01:39:42Z

@beat-buesser Thank you for your explanation about the functionality of ART. I think the explained functionality satisfies my requirements. Then I will try my second proposal and push the commits later.

Signed-off-by: yamamura-k <yamayama23bb@gmail.com>

…able the image-wise stepsize update Signed-off-by: yamamura-k <yamayama23bb@gmail.com>

yamamura-k · 2023-02-17T03:08:59Z

@beat-buesser I modified the implementation of ACG and APGD to enable the image-wise stepsize updates in this pull request. Thank you for your patience and valuable comments and suggestions.
I confirmed that the current implementation showed the similar attack performance to the original implementation.

art/attacks/evasion/auto_projected_gradient_descent.py

Signed-off-by: yamamura-k <yamayama23bb@gmail.com>

…stness-toolbox

art/attacks/evasion/auto_conjugate_gradient.py

art/attacks/evasion/auto_projected_gradient_descent.py

beat-buesser · 2023-02-27T20:17:33Z

Hi @yamamura-k I think the proposed changes above should fix the style checks. What do you think?

Signed-off-by: yamamura-k <yamayama23bb@gmail.com>

yamamura-k · 2023-02-28T01:30:34Z

Hi @beat-buesser, the codes changed by your suggestion fix the style checks, and no new warnings are raised. Thank you so much!

Signed-off-by: yamamura-k <yamayama23bb@gmail.com>

beat-buesser · 2023-03-11T21:55:20Z

Hi @yamamura-k Thank you very much for contributing your attack Auto Conjugate Gradient Attack (ICML 2022) to ART! It will be part of ART 1.14!

yamamura-k · 2023-03-14T00:12:43Z

@beat-buesser Thank you very much for your patience in working with me to modify the code!

…I#2028 because pytorch estimators do not recognize the class created in AGPD and thus do not correctly handle the labels anymore

yamamura-k added 5 commits February 15, 2023 15:37

implement auto conjugate gradient attack

5453928

Signed-off-by: yamamura-k <yamayama23bb@gmail.com>

implement auto conjugate gradient attack

30d3352

Signed-off-by: yamamura-k <yamayama23bb@gmail.com>

implement auto conjugate gradient attack

5019f7b

Signed-off-by: yamamura-k <yamayama23bb@gmail.com>

update docs

b85a854

Signed-off-by: yamamura-k <yamayama23bb@gmail.com>

small fix

b915b03

Signed-off-by: yamamura-k <yamayama23bb@gmail.com>

beat-buesser self-requested a review February 15, 2023 20:07

beat-buesser self-assigned this Feb 15, 2023

beat-buesser added the enhancement New feature or request label Feb 15, 2023

github-advanced-security bot found potential problems Feb 15, 2023

View reviewed changes

beat-buesser added this to the ART 1.14.0 milestone Feb 16, 2023

yamamura-k added 2 commits February 17, 2023 11:45

fix codes for batchsize > 1

4ed12ae

Signed-off-by: yamamura-k <yamayama23bb@gmail.com>

modify the implementation of auto_projected_gradient_descent.py to en…

8efea2c

…able the image-wise stepsize update Signed-off-by: yamamura-k <yamayama23bb@gmail.com>

github-advanced-security bot found potential problems Feb 17, 2023

View reviewed changes

yamamura-k and others added 5 commits February 19, 2023 00:22

fix initialization of stepsize eta

c9dcc32

Signed-off-by: yamamura-k <yamayama23bb@gmail.com>

improve performance and fix bugs

80cfe5c

Signed-off-by: yamamura-k <yamayama23bb@gmail.com>

Change the default reduction type for backward to "sum".

d1a2302

Change the default reduction type for backward to "sum".

dd1be2d

Signed-off-by: yamamura-k <yamayama23bb@gmail.com>

Merge branch 'main' of https://github.com/yamamura-k/adversarial-robu…

169a799

…stness-toolbox