forked from sg-nm/cgp-cnn-PyTorch
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
PyTorch version
- Loading branch information
Showing
1 changed file
with
38 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,2 +1,40 @@ | ||
# cgp-cnn-PyTorch | ||
A Genetic Programming Approach to Designing CNN Architectures, In GECCO 2017 (oral presentation, Best Paper Award) | ||
|
||
# Designing Convolutional Neural Network Architectures Based on Cartegian Genetic Programming | ||
|
||
This repository contains the code for the following paper: | ||
|
||
Masanori Suganuma, Shinichi Shirakawa, and Tomoharu Nagao, "A Genetic Programming Approach to Designing Convolutional Neural Network Architectures," | ||
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO '17, Best paper award), pp. 497-504 (2017) [[paper]](https://doi.org/10.1145/3071178.3071229) [[arXiv]](https://arxiv.org/abs/1704.00764) | ||
|
||
## Requirement | ||
We use the [PyTorch](https://pytorch.org/) framework for neural networks and tested on the following environment: | ||
|
||
* PyTorch version 0.2.0_4 | ||
* Python version 3.6.2 | ||
* CUDA version 8.0 | ||
* Ubuntu 14.04 LTS | ||
|
||
## Usage | ||
|
||
### Run the architecture search | ||
This code can reproduce the experiment for CIFAR-10 dataset with the same setting of the GECCO 2017 paper (by default scenario). The (training) data are split into the training and validation data. The validation data are used for assigning the fitness to the generated architectures. We use the maximum validation accuracy in the last 10 epochs as the fitness value. | ||
|
||
When you use the multiple GPUs, please specify the `-g` option: | ||
|
||
```shell | ||
python exp_main.py -g 2 | ||
``` | ||
|
||
After the execution, the files, `network_info.pickle` and `log_cgp.txt` will be generated. The file `network_info.pickle` contains the information for Cartegian genetic programming (CGP) and `log_cgp.txt` contains the log of the optimization and discovered CNN architecture's genotype lists. | ||
|
||
Some parameters (e.g., # rows and columns of CGP, and # epochs) can easily change by modifying the arguments in the script `exp_main.py`. | ||
|
||
### Re-training | ||
|
||
The discovered architecture is re-trained by the different training scheme (500 epoch training with momentum SGD) to polish up the network parameters. All training data are used for re-training, and the accuracy for the test data set is reported. | ||
|
||
```shell | ||
python exp_main.py -m retrain | ||
``` |