1;3409;0c Constrained K-means Clustering with Background Knowledge

Constrained K-means Clustering with Background Knowledge

18th International Conference on Machine Learning, 2001
Pages: 577-584

ICML

bibtex

Clustering is traditionally viewed as an unsupervised method for data analysis. However, in some cases information about the problem domain is available in addition to the data instances themselves. In this paper, we demonstrate how the popular k-means clustering algorithm can be pro tably modi- ed to make use of this information. In experiments with arti cial constraints on six data sets, we observe improvements in clustering accuracy. We also apply this method to the real-world problem of automatically detecting road lanes from GPS data and observe dramatic increases in performance. 1.