Header menu link for other important links
X
A linear time algorithm for approximate 2-means clustering
Y. Sabharwal,
Published in
2005
Volume: 32
   
Issue: 2
Pages: 159 - 172
Abstract
Matousek [Discrete Comput. Geom. 24 (1) (2000) 61-84] designed an O(nlogn) deterministic algorithm for the approximate 2-means clustering problem for points in fixed dimensional Euclidean space which had left open the possibility of a linear time algorithm. In this paper, we present a simple randomized algorithm to determine an approximate 2-means clustering of a given set of points in fixed dimensional Euclidean space, with constant probability, in linear time. We first approximate the mean of the larger cluster using random sampling. We then show that the problem can be reduced to a set of lines, on which it can be solved by carefully pruning away points of the larger cluster and randomly sampling on the remaining points to obtain an approximate to the mean of the smaller cluster. © 2005 Elsevier B.V.
About the journal
Published in
Open Access
Impact factor
N/A