TY - JOUR
T1 - DO-Conv
T2 - Depthwise Over-Parameterized Convolutional Layer
AU - Cao, Jinming
AU - Li, Yangyan
AU - Sun, Mingchao
AU - Chen, Ying
AU - Lischinski, Dani
AU - Cohen-Or, Daniel
AU - Chen, Baoquan
AU - Tu, Changhe
N1 - Publisher Copyright:
© 1992-2012 IEEE.
PY - 2022
Y1 - 2022
N2 - Convolutional layers are the core building blocks of Convolutional Neural Networks (CNNs). In this paper, we propose to augment a convolutional layer with an additional depthwise convolution, where each input channel is convolved with a different 2D kernel. The composition of the two convolutions constitutes an over-parameterization, since it adds learnable parameters, while the resulting linear operation can be expressed by a single convolution layer. We refer to this depthwise over-parameterized convolutional layer as DO-Conv, which is a novel way of over-parameterization. We show with extensive experiments that the mere replacement of conventional convolutional layers with DO-Conv layers boosts the performance of CNNs on many classical vision tasks, such as image classification, detection, and segmentation. Moreover, in the inference phase, the depthwise convolution is folded into the conventional convolution, reducing the computation to be exactly equivalent to that of a convolutional layer without over-parameterization. As DO-Conv introduces performance gains without incurring any computational complexity increase for inference, we advocate it as an alternative to the conventional convolutional layer. We open sourced an implementation of DO-Conv in Tensorflow, PyTorch and GluonCV at https://github.com/yangyanli/DO-Conv.
AB - Convolutional layers are the core building blocks of Convolutional Neural Networks (CNNs). In this paper, we propose to augment a convolutional layer with an additional depthwise convolution, where each input channel is convolved with a different 2D kernel. The composition of the two convolutions constitutes an over-parameterization, since it adds learnable parameters, while the resulting linear operation can be expressed by a single convolution layer. We refer to this depthwise over-parameterized convolutional layer as DO-Conv, which is a novel way of over-parameterization. We show with extensive experiments that the mere replacement of conventional convolutional layers with DO-Conv layers boosts the performance of CNNs on many classical vision tasks, such as image classification, detection, and segmentation. Moreover, in the inference phase, the depthwise convolution is folded into the conventional convolution, reducing the computation to be exactly equivalent to that of a convolutional layer without over-parameterization. As DO-Conv introduces performance gains without incurring any computational complexity increase for inference, we advocate it as an alternative to the conventional convolutional layer. We open sourced an implementation of DO-Conv in Tensorflow, PyTorch and GluonCV at https://github.com/yangyanli/DO-Conv.
KW - Over-parameterization
KW - convolutional Layer
KW - depthwise convolution
UR - http://www.scopus.com/inward/record.url?scp=85130476809&partnerID=8YFLogxK
U2 - 10.1109/TIP.2022.3175432
DO - 10.1109/TIP.2022.3175432
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 35594231
AN - SCOPUS:85130476809
SN - 1057-7149
VL - 31
SP - 3726
EP - 3736
JO - IEEE Transactions on Image Processing
JF - IEEE Transactions on Image Processing
ER -