Dual Attention Network for Scene Segmentation(CVPR2019)

Jun Fu, Jing Liu, Haijie Tian, Yong Li, Yongjun Bao, Zhiwei Fang,and Hanqing Lu

Introduction

We propose a Dual Attention Network (DANet) to adaptively integrate local features with their global dependencies based on the self-attention mechanism. And we achieve new state-of-the-art segmentation performance on three challenging scene segmentation datasets, i.e., Cityscapes, PASCAL Context and COCO Stuff-10k dataset.

Cityscapes testing set result

We train our DANet-101 with only fine annotated data and submit our test results to the official evaluation server.

Usage

Install pytorch

The code is tested on python3.6 and official Pytorch@commitfd25a2a, please install PyTorch from source.
The code is modified from PyTorch-Encoding.

Clone the repository:

///////liyingying----->
######create env and setup
$ conda create -n torch12 python=3.6
$ conda activate torch12
$ git clone https://github.com/junfu1115/DANet.git 
$ cd DANet 
$ python setup.py develop
$ pip install torchvision
$ cp -a build/lib /home/lisali/anaconda3/envs/torch12/lib/python3.6/site-packages/encoding/

####test
$ cd danet
$ python test.py --help 
You are successful if you can check the args_params

----->2020/02/19


3. Dataset

- Download the [Cityscapes](https://www.cityscapes-dataset.com/) dataset and convert the dataset to [19 categories](https://github.com/mcordts/cityscapesScripts/blob/master/cityscapesscripts/helpers/labels.py). 
- Please put dataset in folder `./datasets`
///////liyingying----->

step0:Download the cityscapes datset and put it in DANet/datasets/

step1:Download cityscapesScripts.git in DANet/datasets/

$ git clone https://github.com/mcordts/cityscapesScripts.git
 
 The folder relation like this:
 ---DANet/datasets/
    ---gtFine
    ---leftImg8bit
    ---cityscapesScripts

step2:You need using cityscapesScripts/preparation/createTrainIdLabelImgs.py to create labels.

 Please open the createTrainIdLabelImgs.py file and change the cityscapesPath.
 such as : cityscapesPath = "/home/lisali/DANet/datasets"
 
 and then running the file just like: 
       $ python cityscapesScripts/preparation/createTrainIdLabelImgs.py
 you will get some image like gtFine/**/**_labelTrainIds.png which the groundTures for input.

step3: You need creating train_fine.txt and val_fine.txt file for image's path.

/////////------>2020/01/17

4 . Evaluation

- Download trained model [DANet101](https://drive.google.com/open?id=1XmpFEF-tbPH0Rmv4eKRxYJngr3pTbj6p) and put it in folder `./danet/cityscapes/model`
- Evaluation code is in folder `./danet/cityscapes`
- `cd danet`

- For single scale testing, please run:

```shell
CUDA_VISIBLE_DEVICES=0,1,2,3 python test.py --dataset cityscapes --model danet --resume-dir cityscapes/model --base-size 2048 --crop-size 768 --workers 1 --backbone resnet101 --multi-grid --multi-dilation 4 8 16 --eval

For multi-scale testing, please run:

CUDA_VISIBLE_DEVICES=0,1,2,3 python test.py --dataset cityscapes --model danet --resume-dir cityscapes/model --base-size 2048 --crop-size 1024 --workers 1 --backbone resnet101 --multi-grid --multi-dilation 4 8 16 --eval --multi-scales

If you want to visualize the result of DAN-101, you can run:

CUDA_VISIBLE_DEVICES=0,1,2,3 python test.py --dataset cityscapes --model danet --resume-dir cityscapes/model --base-size 2048 --crop-size 768 --workers 1 --backbone resnet101 --multi-grid --multi-dilation 4 8 16

Evaluation Result:

The expected scores will show as follows:

(single scale testing denotes as 'ss' and multiple scale testing denotes as 'ms')

DANet101 on cityscapes val set (mIoU/pAcc): 79.93/95.97 (ss) and 81.49/96.41 (ms)
Training:

Training code is in folder ./danet/cityscapes
cd danet

You can reproduce our result by run:

 CUDA_VISIBLE_DEVICES=0,1,2,3 python train.py --dataset cityscapes --model  danet --backbone resnet101 --checkname danet101  --base-size 1024 --crop-size 768 --epochs 240 --batch-size 8 --lr 0.003 --workers 2 --multi-grid --multi-dilation 4 8 16

Note that: We adopt multiple losses in end of the network for better training.

Citation

If DANet is useful for your research, please consider citing:

@article{fu2018dual,
  title={Dual Attention Network for Scene Segmentation},
  author={Jun Fu, Jing Liu, Haijie Tian, Yong Li, Yongjun Bao, Zhiwei Fang,and Hanqing Lu},
  booktitle={The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2019}
}

Acknowledgement

Thanks PyTorch-Encoding, especially the Synchronized BN!

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
PyTorch-Encoding		PyTorch-Encoding
build/lib/encoding		build/lib/encoding
danet		danet
docs		docs
encoding		encoding
experiments		experiments
img		img
scripts		scripts
tests		tests
torch_encoding.egg-info		torch_encoding.egg-info
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
clean.sh		clean.sh
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dual Attention Network for Scene Segmentation(CVPR2019)

Introduction

Cityscapes testing set result

Usage

Citation

Acknowledgement

About

Releases

Packages

Languages

License

Yvette1993/DANet

Folders and files

Latest commit

History

Repository files navigation

Dual Attention Network for Scene Segmentation(CVPR2019)

Introduction

Cityscapes testing set result

Usage

Citation

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages