Abstract
Pre-trained deep neural networks, powerful models trained on large datasets, have become a popular tool in computer vision for transfer learning. However, the standard approach of using a single network potentially misses out on valuable information contained in other readily available models. In this work, we study the Mixture of Experts (MoE) approach for adaptively fusing multiple pre-trained models for each individual input image. In particular, we explore how far we can get by combining diverse pre-trained representations in a customized way that maximizes their potential in a lightweight framework. Our approach is motivated by an empirical study of the predictions made by popular pre-trained nets across various datasets, finding that both performance and agreement between models vary across datasets. We further propose a miniature CNN gating mechanism operating on a thumbnail version of the input image, and show this is enough to guide a good fusion. Finally, we explore a multi-modal blend of visual and natural-language representations, using a label-space embedding to inject pre-trained word-vectors. Across multiple datasets, we demonstrate that an adaptive fusion of pre-trained models can obtain favorable results.
Original language | English |
---|---|
Title of host publication | ICPRAM 2019 - Proceedings of the 8th International Conference on Pattern Recognition Applications and Methods |
Editors | Maria De Marsico, Gabriella Sanniti di Baja, Ana Fred |
Publisher | SciTePress |
Pages | 135-144 |
Number of pages | 10 |
ISBN (Electronic) | 9789897583513 |
DOIs | |
State | Published - 2019 |
Externally published | Yes |
Event | 8th International Conference on Pattern Recognition Applications and Methods, ICPRAM 2019 - Prague, Czech Republic Duration: 19 Feb 2019 → 21 Feb 2019 |
Publication series
Name | ICPRAM 2019 - Proceedings of the 8th International Conference on Pattern Recognition Applications and Methods |
---|
Conference
Conference | 8th International Conference on Pattern Recognition Applications and Methods, ICPRAM 2019 |
---|---|
Country/Territory | Czech Republic |
City | Prague |
Period | 19/02/19 → 21/02/19 |
Bibliographical note
Publisher Copyright:Copyright © 2019 by SCITEPRESS - Science and Technology Publications, Lda. All rights reserved
Keywords
- Deep Learning
- Fusion