TY - JOUR
T1 - An on-line agglomerative clustering method for nonstationary data
AU - Guedalia, Isaac David
AU - London, Mickey
AU - Werman, Michael
PY - 1999/2/15
Y1 - 1999/2/15
N2 - An on-line agglomerative clustering algorithm for nonstationary data is described. Three issues are addressed. The first regards the temporal aspects of the data. The clustering of stationary data by the proposed al-gorithm is comparable to the other popular algorithms tested (batch and on-line). The second issue addressed is the number of clusters required to represent the data. The algorithm provides an efficient framework to determine the natural number of clusters given the scale of the problem. Finally, the proposed algorithm implicitly minimizes the local distortion, a measure that takes into account clusters with relatively small mass. In contrast, most existing on-line clustering methods assume stationarity of the data. When used to cluster nonstationary data, these methods fail to generate a good representation. Moreover, most current algorithms are computationally intensive when determining the correct number of clusters. These algorithms tend to neglect clusters of small mass due to their minimization of the global distortion (Energy).
AB - An on-line agglomerative clustering algorithm for nonstationary data is described. Three issues are addressed. The first regards the temporal aspects of the data. The clustering of stationary data by the proposed al-gorithm is comparable to the other popular algorithms tested (batch and on-line). The second issue addressed is the number of clusters required to represent the data. The algorithm provides an efficient framework to determine the natural number of clusters given the scale of the problem. Finally, the proposed algorithm implicitly minimizes the local distortion, a measure that takes into account clusters with relatively small mass. In contrast, most existing on-line clustering methods assume stationarity of the data. When used to cluster nonstationary data, these methods fail to generate a good representation. Moreover, most current algorithms are computationally intensive when determining the correct number of clusters. These algorithms tend to neglect clusters of small mass due to their minimization of the global distortion (Energy).
UR - http://www.scopus.com/inward/record.url?scp=0033556908&partnerID=8YFLogxK
U2 - 10.1162/089976699300016755
DO - 10.1162/089976699300016755
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 9950742
AN - SCOPUS:0033556908
SN - 0899-7667
VL - 11
SP - 521
EP - 540
JO - Neural Computation
JF - Neural Computation
IS - 2
ER -