DualConv：轻质深神经网络的双卷积内核

论文标题

DualConv：轻质深神经网络的双卷积内核

DualConv: Dual Convolutional Kernels for Lightweight Deep Neural Networks

论文作者

Zhong, Jiachen, Chen, Junying, Mian, Ajmal

论文摘要

CNN体系结构通常在内存和计算要求上很大，这使得它们对于具有有限的硬件资源的嵌入式系统不可行。我们提出了双重卷积内核（DualConv），用于构建轻质的深神经网络。 DualConv结合了3 $ \ times $ 3和1 $ \ times $ 1卷积内核，以同时处理相同的输入特征映射频道，并利用组卷积技术有效地安排卷积过滤器。 DualConv可用于任何CNN模型，例如VGG-16和Resnet-50用于图像分类，Yolo和R-CNN进行对象检测，或用于语义分割的FCN。在本文中，我们广泛测试了DualConv进行分类，因为这些网络体系结构构成了许多其他任务的骨干。我们还测试了YOLO-V3上的DualConv检测图像。实验结果表明，与我们的结构创新相结合，DualConv显着降低了深神经网络的计算成本和参数数量，而在某些情况下，与原始模型相比，其准确性略高。我们使用DualConv将轻量级MobilenetV2的参数数量进一步降低54％，而CIFAR-100数据集的准确性仅下降0.68％。当参数的数量不是问题时，DualConv在同一数据集上将MobilenetV1的准确性提高了4.11％。此外，DualConv显着提高了Yolo-V3对象检测速度，并在Pascal VOC数据集上提高了4.4％的精度。

CNN architectures are generally heavy on memory and computational requirements which makes them infeasible for embedded systems with limited hardware resources. We propose dual convolutional kernels (DualConv) for constructing lightweight deep neural networks. DualConv combines 3$\times$3 and 1$\times$1 convolutional kernels to process the same input feature map channels simultaneously and exploits the group convolution technique to efficiently arrange convolutional filters. DualConv can be employed in any CNN model such as VGG-16 and ResNet-50 for image classification, YOLO and R-CNN for object detection, or FCN for semantic segmentation. In this paper, we extensively test DualConv for classification since these network architectures form the backbones for many other tasks. We also test DualConv for image detection on YOLO-V3. Experimental results show that, combined with our structural innovations, DualConv significantly reduces the computational cost and number of parameters of deep neural networks while surprisingly achieving slightly higher accuracy than the original models in some cases. We use DualConv to further reduce the number of parameters of the lightweight MobileNetV2 by 54% with only 0.68% drop in accuracy on CIFAR-100 dataset. When the number of parameters is not an issue, DualConv increases the accuracy of MobileNetV1 by 4.11% on the same dataset. Furthermore, DualConv significantly improves the YOLO-V3 object detection speed and improves its accuracy by 4.4% on PASCAL VOC dataset.

下载PDF全文

下载文献需遵守相关版权规定

论文标题