Fix `ActivationDefense` and `SpectralSignatures` expected flattened bug #2327

f4str · 2023-11-14T21:59:49Z

Description

Modify the ActivateDefense and SpectralSignatures poisoning defenses to flatten the outputs when calling get_activations() on the estimator.

~~Additionally, the hack in HuggingFace classifier to use the inputs post-flattening was removed since the defenses will now do the flattening and work with any arbitrary layer.~~

Fixes #2313

Type of change

Please check all relevant options.

Improvement (non-breaking)
Bug fix (non-breaking)
New feature (non-breaking)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Testing

Please describe the tests that you ran to verify your changes. Consider listing any relevant details of your test configuration.

Tests for Activation Defense are unchanged
Tests for Spectral Signatures are unchanged

Test Configuration:

OS
Python version
ART version or commit number
TensorFlow / Keras / PyTorch / MXNet version

Checklist

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
My changes have been tested using both CPU and GPU devices

Signed-off-by: Farhan Ahmed <Farhan.Ahmed@ibm.com>

codecov-commenter · 2023-11-14T22:04:36Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (a281f62) 76.18% compared to head (fc7cbcb) 84.09%.

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@              Coverage Diff               @@
##           dev_1.17.0    #2327      +/-   ##
==============================================
+ Coverage       76.18%   84.09%   +7.90%     
==============================================
  Files             327      327              
  Lines           30850    30852       +2     
  Branches         5716     5716              
==============================================
+ Hits            23503    25944    +2441     
+ Misses           5914     3448    -2466     
- Partials         1433     1460      +27

Files	Coverage Δ
art/defences/detector/poison/activation_defence.py	`83.28% <100.00%> (+0.04%)`	⬆️
...nces/detector/poison/spectral_signature_defense.py	`84.72% <100.00%> (+0.21%)`	⬆️

... and 63 files with indirect coverage changes

beat-buesser

Hi @f4str Thank you very much for your pull request! It looks good to me.

This reverts commit 4db7626. Signed-off-by: Farhan Ahmed <Farhan.Ahmed@ibm.com>

f4str · 2023-12-08T00:24:39Z

Due to issues with GitHub actions tests failing, removing the hack in the HuggingFace classifier to use the inputs post-flattening will not be included in this PR. Once the failed unit test is further investigated, the change will be revisited.

f4str added 2 commits November 14, 2023 12:04

flatten activations for poisoning defenses

123af2c

Signed-off-by: Farhan Ahmed <Farhan.Ahmed@ibm.com>

remove huggingface estimator activation hack

4db7626

Signed-off-by: Farhan Ahmed <Farhan.Ahmed@ibm.com>

f4str marked this pull request as ready for review November 15, 2023 18:16

beat-buesser self-requested a review November 30, 2023 15:52

beat-buesser approved these changes Nov 30, 2023

View reviewed changes

beat-buesser self-assigned this Nov 30, 2023

beat-buesser added bug Something isn't working improvement Improve implementation labels Nov 30, 2023

beat-buesser added this to the ART 1.17.0 milestone Nov 30, 2023

beat-buesser linked an issue Nov 30, 2023 that may be closed by this pull request

ActivationDefense and SpectralSignatures expect flattened activations #2313

Closed

beat-buesser and others added 2 commits November 30, 2023 18:18

Merge branch 'dev_1.17.0' into activation-defense-bug

b9f5a4d

Revert "remove huggingface estimator activation hack"

d345786

This reverts commit 4db7626. Signed-off-by: Farhan Ahmed <Farhan.Ahmed@ibm.com>

Merge branch 'dev_1.17.0' into activation-defense-bug

fc7cbcb

beat-buesser merged commit 95c778e into Trusted-AI:dev_1.17.0 Dec 20, 2023
35 checks passed

f4str deleted the activation-defense-bug branch December 21, 2023 15:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `ActivationDefense` and `SpectralSignatures` expected flattened bug #2327

Fix `ActivationDefense` and `SpectralSignatures` expected flattened bug #2327

f4str commented Nov 14, 2023 •

edited

Loading

codecov-commenter commented Nov 14, 2023 •

edited

Loading

beat-buesser left a comment

f4str commented Dec 8, 2023

Fix ActivationDefense and SpectralSignatures expected flattened bug #2327

Fix ActivationDefense and SpectralSignatures expected flattened bug #2327

Conversation

f4str commented Nov 14, 2023 • edited Loading

Description

Type of change

Testing

Checklist

codecov-commenter commented Nov 14, 2023 • edited Loading

Codecov Report

beat-buesser left a comment

Choose a reason for hiding this comment

f4str commented Dec 8, 2023

Fix `ActivationDefense` and `SpectralSignatures` expected flattened bug #2327

Fix `ActivationDefense` and `SpectralSignatures` expected flattened bug #2327

f4str commented Nov 14, 2023 •

edited

Loading

codecov-commenter commented Nov 14, 2023 •

edited

Loading