Practical Work (WS 2023): Reimplementation of "Adversarial Examples Are Not Bugs, They Are Features"

Author: Franziska Denk (k11904292)

This work is implementing the experiments from Ilyas et al. [1] within the scope of the practical work in the master studies Artificial Intelligence at JKU.

How to rerun the experiments?

Install the requirements conda create --name practical_work --file requirements.txt
Run the experiments using scripts/experiment.bat. Hyperparameters can be adjusted in there. For a list of available hyperparameters have a look at src/train.py.
Evaluate the models from the experiments in the second step using 02-evaluate.ipynb or via the command-line using src/test.py.

Findings

The results from Ilyas et al. (2019) [1] can be replicated in a way that the relative accuracy of the experiments to each other is similar. This confirms their main findings.

The main differences are:

The PGD attack in this work doesn't achieve such low adversarial accuracy for the standard model, as in the work of Ilyas et al. For a PGD attack with $\epsilon=0.5$, steps $=7$ and $\alpha=0.1$, we achieve adversarial accuracy of ~ $10$ %, while Ilyas et al. report $0$ %.
For creating $\mathcal{D} _{rand}$ and $\mathcal{D} _{det}$ with PGD, $\epsilon$ needed to be larger in this work than in their setting. We used $\epsilon=2$ while they had $\epsilon=0.5$

References

[1] Andrew Ilyas, Shibani Santurkar, Dimitris Tsipras, Logan Engstrom, Brandon Tran and Aleksander Madry: “Adversarial Examples Are Not Bugs, They Are Features”. In: Advances in Neural Information Processing Systems (NeurIPS). vol. 32. Vancouver, Canada: Curran Associates, Inc., 2019, pp. 125–136.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
ckpts		ckpts
scripts		scripts
src		src
.gitignore		.gitignore
00-sanity-checks.ipynb		00-sanity-checks.ipynb
01-train.ipynb		01-train.ipynb
02-evaluate.ipynb		02-evaluate.ipynb
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Practical Work (WS 2023): Reimplementation of "Adversarial Examples Are Not Bugs, They Are Features"

Author: Franziska Denk (k11904292)

How to rerun the experiments?

Findings

References

Further readings

About

Releases

Packages

Languages

franziska-denk/practical-work

Folders and files

Latest commit

History

Repository files navigation

Practical Work (WS 2023): Reimplementation of "Adversarial Examples Are Not Bugs, They Are Features"

Author: Franziska Denk (k11904292)

How to rerun the experiments?

Findings

References

Further readings

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages