3.1 Light-Head R-CNN思考

作者: 深度学习模型优化 | 来源:发表于2019-04-28 10:29 被阅读0次

Light-Head R-CNN实在R-FCN的基础上进行改进的。
该方法使用的网络是Two-stage网络。

1 核心思想

Light-Head R-CNN的改进思路是轻量化stage 2的head部分，提高推理速度。

图1 Faster R-CNN

Faster R-CNN的细节可以看我的简书3 计算机视觉-阅读笔记（2）。

图2 R-FCN

R-FCN是在Faster R-CNN基础上发展而来的，核心改进是用卷积层代替了全连接层。其源代码可以参考R-FCN。

图3 Light-head R-CNN

Light-Head R-CNN是在R-FCN的基础上改进而来的，核心思想是在R-FCN的heavy head改为light head，从而提高计算速度。主要体现在两点：

这里的large separable convolution用 $1\times k$ 和 $k \times 1$ 的串行卷积替换了 $k \times k$ 的卷积。

2 代码实现

实际训练中，使用了如下设置：

synchronized SGD with weight decay: 0.0001,momentum: 0.9.
Each mini-batch has 2 images per GPU.
Each image has 2000/1000 RoIs for training/testing.
Pad images within mini-batch to the same size by filling zeros into the right-bottom of the image.
Learning rate: 0.01 for first 1.5M iterations and 0.001 for later 0.5M iterations.
Adopt atrous algorithm in stage 5 of Resnet.
Adopt OHEM.
Backbone network is initialized based on the pre-trained ImageNet.
Pooling size: 7.
Batch normalization is also fixed for faster experiment.
Data augmentation: Horizontal image flipping.

为了平衡精度与速度：

图4 准确率比较

图5 性能比较

本文首先介绍Light-Head R-CNN的设计思想，并且介绍了其改进特点，然后对训练过程中的一些细节进行描述，最后给出了性能分析。

本文标题：3.1 Light-Head R-CNN思考

本文链接：https://www.haomeiwen.com/subject/ifxfnqtx.html