An on-line agglomerative clustering method for nonstationary data

Isaac David Guedalia*, Mickey London, Michael Werman

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

52 Scopus citations

Abstract

An on-line agglomerative clustering algorithm for nonstationary data is described. Three issues are addressed. The first regards the temporal aspects of the data. The clustering of stationary data by the proposed al-gorithm is comparable to the other popular algorithms tested (batch and on-line). The second issue addressed is the number of clusters required to represent the data. The algorithm provides an efficient framework to determine the natural number of clusters given the scale of the problem. Finally, the proposed algorithm implicitly minimizes the local distortion, a measure that takes into account clusters with relatively small mass. In contrast, most existing on-line clustering methods assume stationarity of the data. When used to cluster nonstationary data, these methods fail to generate a good representation. Moreover, most current algorithms are computationally intensive when determining the correct number of clusters. These algorithms tend to neglect clusters of small mass due to their minimization of the global distortion (Energy).

Original languageEnglish
Pages (from-to)521-540
Number of pages20
JournalNeural Computation
Volume11
Issue number2
DOIs
StatePublished - 15 Feb 1999

Fingerprint

Dive into the research topics of 'An on-line agglomerative clustering method for nonstationary data'. Together they form a unique fingerprint.

Cite this