Abstract
In various situations one is given only the predictions of multiple classifiers over a large un-labeled test data. This scenario raises the following questions: Without any labeled data and without any a-priori knowledge about the reliability of these different classifiers, is it possible to consistently and computationally efficiently estimate their accuracies? Furthermore, also in a completely unsupervised manner, can one construct a more accurate un-supervised ensemble classifier? In this paper, focusing on the binary case, we present simple, computationally efficient algorithms to solve these questions. Furthermore, under standard classifier independence assumptions, we prove our methods are consistent and study their asymptotic error. Our approach is spectral based on the fact that the off-diagonal entries of the classifiers' covari-ance matrix and 3-d tensor are rank-one. We illustrate the competitive performance of our algorithms via extensive experiments on both artificial and real datasets.
Original language | English |
---|---|
Pages (from-to) | 407-415 |
Number of pages | 9 |
Journal | Journal of Machine Learning Research |
Volume | 38 |
State | Published - 2015 |
Externally published | Yes |
Event | 18th International Conference on Artificial Intelligence and Statistics, AISTATS 2015 - San Diego, United States Duration: 9 May 2015 → 12 May 2015 Conference number: 18 https://proceedings.mlr.press/v38 |
Bibliographical note
Publisher Copyright:Copyright 2015 by the authors.