Network Deconvolution

Ye, Chengxi, Evanusa, Matthew, He, Hua, Mitrokhin, Anton, Goldstein, Thomas, Yorke, James A., Fermüller, Cornelia, Aloimonos, Yiannis

May-28-2019–arXiv.org Machine Learning

Convolution is a central operation in Convolutional Neural Networks (CNNs), which applies a kernel or mask to overlapping regions shifted across the image. In this work we show that the underlying kernels are trained with highly correlated data, which leads to co-adaptation of model weights. To address this issue we propose what we call network deconvolution, a procedure that aims to remove pixel-wise and channel-wise correlations before the data is fed into each layer. We show that by removing this correlation we are able to achieve better convergence rates during model training with superior results without the use of batch normalization on the CIFAR-10, CIFAR-100, MNIST, Fashion-MNIST datasets, as well as against reference models from "model zoo" on the ImageNet standard benchmark.

deconvolution, deep learning, neural network, (19 more...)

arXiv.org Machine Learning

May-28-2019

arXiv.org PDF

Add feedback

Country:
- North America > United States > Maryland > Prince George's County > College Park (0.14)

Genre:
- Research Report (0.84)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks > Deep Learning (0.67)
  - Statistical Learning (0.95)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found