adding adversarial weight perturbation protocol #2224

Zaid-Hameed · 2023-07-20T00:08:10Z

Description

AWP is an important adversarial training approach because it provides better robustness against adversarial attacks and mitigates robust overfitting. AWP has been proposed in paper "Adversarial Weight Perturbation Helps
Robust Generalization".

Paper link: https://proceedings.neurips.cc/paper/2020/file/1ef91c212e30e14bf125e9374262401f-Paper.pdf

It is also a base component of more advanced adversarial training approaches.

Fixes #2164

Type of change

Please check all relevant options.

Improvement (non-breaking)
Bug fix (non-breaking)
New feature (non-breaking)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Testing

Please describe the tests that you ran to verify your changes. Consider listing any relevant details of your test configuration.

Adversarial weight perturbation based training implementation produces results similar to original implementation
All functions in implemented code work as expected

Test Configuration:

OS: Red Hat Enterprise Linux 8.7 (Ootpa)
Python version: 3.9.12
ART version or commit number
TensorFlow / Keras / PyTorch / MXNet version: PyTorch 1.13.1+cu117

Checklist

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Signed-off-by: Muhammad Zaid Hameed <Zaid.Hameed@ibm.com>

codecov-commenter · 2023-07-20T00:13:04Z

Codecov Report

Merging #2224 (0a78cdb) into dev_1.16.0 (19259d7) will increase coverage by 0.09%.
The diff coverage is 89.34%.

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

@@              Coverage Diff               @@
##           dev_1.16.0    #2224      +/-   ##
==============================================
+ Coverage       84.76%   84.85%   +0.09%     
==============================================
  Files             313      315       +2     
  Lines           27810    28054     +244     
  Branches         5086     5123      +37     
==============================================
+ Hits            23572    23805     +233     
+ Misses           2948     2941       -7     
- Partials         1290     1308      +18

Files Changed	Coverage Δ
...efences/trainer/adversarial_trainer_awp_pytorch.py	`88.07% <88.07%> (ø)`
art/defences/trainer/__init__.py	`100.00% <100.00%> (ø)`
art/defences/trainer/adversarial_trainer_awp.py	`100.00% <100.00%> (ø)`

... and 6 files with indirect coverage changes

📢 Have feedback on the report? Share it here.

art/defences/trainer/adversarial_trainer_awp.py

+    def fit_generator(  # pylint: disable=W0221
+        self,
+        generator: DataGenerator,
+        validation_data: Optional[Tuple[np.ndarray, np.ndarray]] = None,
+        nb_epochs: int = 20,
+        **kwargs
+    ):


Signed-off-by: Muhammad Zaid Hameed <Zaid.Hameed@ibm.com>

beat-buesser

Hi @Zaid-Hameed Thank you very much for your pull request! I have added a few minor comments on using properties to avoid pylint warnings. What do you think?

Have you tested your code on GPUs?

beat-buesser · 2023-08-03T11:32:57Z

art/defences/trainer/adversarial_trainer_awp.py

+    from art.utils import CLASSIFIER_LOSS_GRADIENTS_TYPE
+
+
+class AdversarialTrainerAWP(Trainer, abc.ABC):


Is inheriting from abc.ABC required here? Trainer is already inheriting from abc.ABC.

Done by removing abc.ABC.

beat-buesser · 2023-08-03T11:33:57Z

art/defences/trainer/adversarial_trainer_awp.py

+# TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+# SOFTWARE.
+"""
+This module implements adversarial training with AWP protocol.


Let's introduce the abbreviation AWP somewhere.

Suggested change

This module implements adversarial training with AWP protocol.

This module implements adversarial training with Adversarial Weight Perturbation (AWP) protocol.

beat-buesser · 2023-08-03T11:35:28Z