Abstract
We present a deep layered architecture that generalizes convolutional neural networks (ConvNets). The architecture, called SimNets, is driven by two operators: (i) a similarity function that generalizes inner-product, and (ii) a log-mean-exp function called MEX that generalizes maximum and average. The two operators applied in succession give rise to a standard neuron but in 'feature space'. The feature spaces realized by SimNets depend on the choice of the similarity operator. The simplest setting, which corresponds to a convolution, realizes the feature space of the Exponential kernel, while other settings realize feature spaces of more powerful kernels (Generalized Gaussian, which includes as special cases RBF and Laplacian), or even dynamically learned feature spaces (Generalized Multiple Kernel Learning). As a result, the SimNet contains a higher abstraction level compared to a traditional ConvNet. We argue that enhanced expressiveness is important when the networks are small due to run-time constraints (such as those imposed by mobile applications). Empirical evaluation validates the superior expressiveness of SimNets, showing a significant gain in accuracy over ConvNets when computational resources at run-time are limited. We also show that in large-scale settings, where computational complexity is less of a concern, the additional capacity of SimNets can be controlled with proper regularization, yielding accuracies comparable to state of the art ConvNets.
Original language | English |
---|---|
Title of host publication | Proceedings - 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 |
Publisher | IEEE Computer Society |
Pages | 4782-4791 |
Number of pages | 10 |
ISBN (Electronic) | 9781467388504 |
DOIs | |
State | Published - 9 Dec 2016 |
Event | 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 - Las Vegas, United States Duration: 26 Jun 2016 → 1 Jul 2016 |
Publication series
Name | Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition |
---|---|
Volume | 2016-December |
ISSN (Print) | 1063-6919 |
Conference
Conference | 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 |
---|---|
Country/Territory | United States |
City | Las Vegas |
Period | 26/06/16 → 1/07/16 |
Bibliographical note
Publisher Copyright:© 2016 IEEE.