Modeling Adversarial Noise for Adversarial Training

The implementation of Modeling Adversarial Noise for Adversarial Training (ICML 2022).

Deep neural networks have been demonstrated to be vulnerable to adversarial noise, promoting the development of defense against adversarial attacks. Motivated by the fact that adversarial noise contains well-generalizing features and that the relationship between adversarial data and natural data can help infer natural data and make reliable predictions, in this paper, we study to model adversarial noise by learning the transition relationship between adversarial labels (i.e. the flipped labels used to generate adversarial data) and natural labels (i.e. the ground truth labels of the natural data). Specifically, we introduce an instance-dependent transition matrix to relate adversarial labels and natural labels, which can be seamlessly embedded with the target model (enabling us to model stronger adaptive adversarial noise). Empirical evaluations demonstrate that our method could effectively improve adversarial accuracy.

The illustration of our proposed Modeling Adversarial Noise-based method (MAN). $\boldsymbol{y}$ denotes the ground-truth label $y$ in the form of a vector. $\boldsymbol{\hat{y}^{\prime}}$ and $\boldsymbol{\hat{y}}$ denote the probability of the estimated mixture label $\hat{y}^{\prime}$ and the probability of the inferred natural label $\hat{y}$, respectively. That is, $\boldsymbol{\hat{y}^{\prime}}=\delta(h_{\theta}(x^{\prime}))$ and $\boldsymbol{\hat{y}}=\boldsymbol{\hat{y}^{\prime}} \cdot \widehat{T}(x^{\prime};\omega)$.

Requirements

This codebase is written for python3 and pytorch.
To install necessary python packages, run pip install -r requirements.txt.
Experiments

Data

Please download and place the dataset into the 'data' directory.

Training

To train the standard AT model

python train_standard.py --model_dir 'your checkpoint directory'

To train the MAN-based model

python train_standard_man.py --model_dir 'your checkpoint directory'

Test

To test the learned model using general attacks

python test_comparison.py --model_dir 'your checkpoint directory' (including PGD, Autoattack, DDN attack, etc.)

To test the learned model using adaptive attacks

python test_adaptive.py --model_dir 'your checkpoint directory'

License and Contributing

This README is formatted based on paperswithcode.
Feel free to post issues via Github.

Reference

If you find the code useful in your research, please consider citing our paper:

@inproceedings{zhou2022modeling,
  title={Modeling Adversarial Noise for Adversarial Training},
  author={Zhou, Dawei and Wang, Nannan and Han, Bo and Liu, Tongliang},
  booktitle={International Conference on Machine Learning},
  pages={27353--27366},
  year={2022},
  organization={PMLR}
}

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
Extension		Extension
models		models
utils		utils
utils_attack		utils_attack
README.md		README.md
arch.png		arch.png
data.py		data.py
requirements.txt		requirements.txt
test_adaptive.py		test_adaptive.py
test_comparison.py		test_comparison.py
train_stand_man.py		train_stand_man.py
train_standard.py		train_standard.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Modeling Adversarial Noise for Adversarial Training

Requirements

Experiments

Data

Training

Test

License and Contributing

Reference

About

Releases

Packages

Languages

dwDavidxd/MAN

Folders and files

Latest commit

History

Repository files navigation

Modeling Adversarial Noise for Adversarial Training

Requirements

Experiments

Data

Training

Test

License and Contributing

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages