models/PaddleSlim/ssd at develop · baiyfbupt/models

Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md
compress.yaml	compress.yaml
image_util.py	image_util.py
mobilenet_ssd.py	mobilenet_ssd.py
reader.py	reader.py
run.sh	run.sh
train.py	train.py
utility.py	utility.py

本示例压缩目标为MobileNetV1-SSD. 主要裁剪主干网络的卷积通道数。

第一步：观察网络结构

该模型的主干网络为MobileNetV1, 主要包含两种卷积：depthwise convolution和普通1X1卷积, 考虑到depthwise convolution的特殊性，我们只对普通1X1卷积做裁剪。

首先，我们需要知道主干网络中普通1X1卷积的参数（filter weights）的名称，在当前实现中，网络结构定义在fluid.default_main_program中，可以通过以下方式打印出网络中所有参数的名称和形状：

for param in fluid.default_main_program().global_block().all_parameters():
    print("{}: {}".format(param.name, param.shape))

上述代码会按网络定义顺序，依次打印相应的参数名称，如下所示：

conv2d_0.w_0 (32L, 3L, 3L, 3L)
depthwise_conv2d_0.w_0 (32L, 1L, 3L, 3L)
conv2d_1.w_0 (64L, 32L, 1L, 1L)
depthwise_conv2d_1.w_0 (64L, 1L, 3L, 3L)
conv2d_2.w_0 (128L, 64L, 1L, 1L)
depthwise_conv2d_2.w_0 (128L, 1L, 3L, 3L)
conv2d_3.w_0 (128L, 128L, 1L, 1L)
depthwise_conv2d_3.w_0 (128L, 1L, 3L, 3L)
conv2d_4.w_0 (256L, 128L, 1L, 1L)
depthwise_conv2d_4.w_0 (256L, 1L, 3L, 3L)
conv2d_5.w_0 (256L, 256L, 1L, 1L)
depthwise_conv2d_5.w_0 (256L, 1L, 3L, 3L)
conv2d_6.w_0 (512L, 256L, 1L, 1L)
depthwise_conv2d_6.w_0 (512L, 1L, 3L, 3L)
conv2d_7.w_0 (512L, 512L, 1L, 1L)
depthwise_conv2d_7.w_0 (512L, 1L, 3L, 3L)
conv2d_8.w_0 (512L, 512L, 1L, 1L)
depthwise_conv2d_8.w_0 (512L, 1L, 3L, 3L)
conv2d_9.w_0 (512L, 512L, 1L, 1L)
depthwise_conv2d_9.w_0 (512L, 1L, 3L, 3L)
conv2d_10.w_0 (512L, 512L, 1L, 1L)
depthwise_conv2d_10.w_0 (512L, 1L, 3L, 3L)
conv2d_11.w_0 (512L, 512L, 1L, 1L)
depthwise_conv2d_11.w_0 (512L, 1L, 3L, 3L)
conv2d_12.w_0 (1024L, 512L, 1L, 1L)
depthwise_conv2d_12.w_0 (1024L, 1L, 3L, 3L)

观察可知，普通1X1卷积名称为conv2d_1.w_0~conv2d_12.w_0, 用正则表达式可表示为：

"conv2d_[1-9].w_0|conv2d_1[0-2].w_0"

第二步：编写配置文件

我们以uniform pruning为例, 需要重点注意以下配置：

target_ratio：指将被剪裁掉的flops的比例, 该选项的设置还要考虑主干网络参数量占全网络比例，如果该选项设置的太大，某些卷积层的channel会被全部裁剪掉，为了避免该问题，建议多先尝试设置不同的值，观察卷积层被裁剪的情况，然后再设置合适的值。当前示例会以0.2为例。
pruned_params：将被裁剪的参数的名称，支持正则表达式，注意设置的正则表达式一定不要匹配到不想被裁剪到的参数名。最安全的做法是设置为param_1_name|param_2_name|param_3_name类似的格式，这样可以严格匹配指定的参数名。根据第一步，当前示例设置为conv2d_[1-9].w_0|conv2d_1[0-2].w_0

第三步：编写压缩脚本

当前示例的压缩脚本是在脚本ssd/train.py基础上修改的。

需要注意一下几点：

fluid.metrics.DetectionMAP

PaddleSlim暂时不支持fluid.metrics和fluid.evaluator, 所以这里将metrics.DetectionMAP改写为：

gt_label = fluid.layers.cast(x=gt_label, dtype=gt_box.dtype)
if difficult:
    difficult = fluid.layers.cast(x=difficult, dtype=gt_box.dtype)
    gt_label = fluid.layers.reshape(gt_label, [-1, 1])
    difficult = fluid.layers.reshape(difficult, [-1, 1])
    label = fluid.layers.concat([gt_label, difficult, gt_box], axis=1)
else:
    label = fluid.layers.concat([gt_label, gt_box], axis=1)

map_var = fluid.layers.detection.detection_map(
        nmsed_out,
        label,
        class_num,
        background_label=0,
        overlap_threshold=0.5,
        evaluate_difficult=False,
        ap_version=ap_version)

data reader

注意在构造Compressor时，train_reader和eval_reader给的都是py_reader. 因为用了py_reader所以不需要再给feed_list.

   compressor = Compressor(
        place,
        fluid.global_scope(),
        train_prog,
        train_reader=train_py_reader, # noteeeeeeeeeeeee
        train_feed_list=None, # noteeeeeeeeeeeee
        train_fetch_list=train_fetch_list,
        eval_program=test_prog,
        eval_reader=test_py_reader, # noteeeeeeeeeeeee
        eval_feed_list=None, # noteeeeeeeeeeeee
        eval_fetch_list=val_fetch_list,
        train_optimizer=None)

第四步：保存剪裁模型

以下代码为保存剪枝后的模型:

com_pass = Compressor(...)
com_pass.config(args.compress_config)
com_pass.run()

pruned_prog = com_pass.eval_graph.program

fluid.io.save_inference_model("./pruned_model/", [image.name, label.name], [acc_top1], exe, main_program=pruned_prog)

# check the shape of parameters
for param in pruned_prog.global_block().all_parameters():
    print("name: {}; shape: {}".format(param.name, param.shape))

关于save_inference_model api请参考：https://www.paddlepaddle.org.cn/documentation/docs/zh/1.5/api_cn/io_cn.html#save-inference-model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

ssd

ssd

README.md

第一步：观察网络结构

第二步：编写配置文件

第三步：编写压缩脚本

fluid.metrics.DetectionMAP

data reader

第四步：保存剪裁模型

Files

ssd

Directory actions

More options

Directory actions

More options

Latest commit

History

ssd

Folders and files

parent directory

README.md

第一步：观察网络结构

第二步：编写配置文件

第三步：编写压缩脚本

fluid.metrics.DetectionMAP

data reader

第四步：保存剪裁模型