This repo is the official code for "Diversity-Measurable Anomaly Detection", CVPR 2023 by Wenrui Liu, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen.
- Linux or Win
- Python 3.6
- PyTorch 1.9.0
- TorchVision 0.10.0
- Numpy 1.19.5
- OpenCV 3.4.2
- Scipy 1.5.4
- PIL 8.1.0
If you need to download a dataset, these resources may be helpful:
Then you need to move the downloaded datasets, like ``./dataset/ped2/'' for PDM and specify the corresponding path for PPDM.
Download VQ-CVAE-based DMAD-PDM from GoogleDrive or design your custom dataset with code in DMAD-Toy Then chech the arguments parser and run:
cd DMAD-Toy
python main.py
The master branch does not contain pre-processing files (>25MB, e.g. background template for ShanghaiTech) and you need to download them at: BaiduPan (Password:dmad) or GoogleDrive
All files are located with the same path, you just need to download them to the Corresponding Folder and Unzip them.
To train or test PDM version DMAD framework, just run:
cd DMAD-PDM
python [[Train], [Evaluate]]_[[ped2], [avenue], [shanghai]].py
<*optional> python process_avenue.py # to remove static anomalies, e.g. bag and a sitting person on the left side of the screen in dir_1 and dir_2
To train or test PPDM version DMAD framework, just run:
cd DMAD-PPDM
python [[Train], [Evaluate]]_mvtec.py
We also provide four model checkpoints to reproduce the performance report in the papar at: BaiduPan (Password:dmad) or GoogleDrive
We mainly focuses on the diversity modeling and measurement framework, where geometrical diversity is just one of common pattern in anomaly detection. However, as for anomaly with other kind of diversities, e.g. colors, the geometric-specific diversity measurement may not be positively correlated to anomaly severity. Besides, the training of PDM version may be unstable under certain parameters.
Our solution about stable diversity-agnostic modeling approach will be linked to this repo after relevant paper is accepted.
The architecture for PPDM version is based on the implementation of RD. VQ-Layer and the architecture for toy experiment on MNIST is based on the implementation of VQ-VAE. We also thank the authors of MNAD for the framework of loading data and models applied to surveillance videos.
If you find this code useful, please consider citing our paper:
@inproceedings{liu2023dmad,
title={Diversity-Measurable Anomaly Detection},
author={Wenrui Liu and Hong Chang and Bingpeng Ma and Shiguang Shan and Xilin Chen},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month={June},
year={2023},
pages={12147-12156}
}