MaxD K-Means: A clustering algorithm for auto-generation of centroids and distance of data points in clusters

Beg, Abul Hashem; Fazley Rabbi, Khandakar; Herawan, Tutut; Wan Maseri, Wan Mohd

MaxD K-Means: A clustering algorithm for auto-generation of centroids and distance of data points in clusters

Authors: Abul Hashem Beg
Khandakar Fazley Rabbi
Tutut Herawan
Wan Mohd Wan Maseri
Publication date: 1 January 2012
Publisher: 'Springer Science and Business Media LLC'
Doi

Abstract

K-Means is one of the unsupervised learning and partitioning clustering algorithms. It is very popular and widely used for its simplicity and fastness. The main drawback of this algorithm is that user should specify the number of cluster in advance. As an iterative clustering strategy, K-Means algorithm is very sensitive to the initial starting conditions. In this paper, we propose a clustering technique called MaxD K-Means clustering algorithm. MaxD K-Means algorithm auto generates initial k (the desired number of cluster) without asking for input from the user. MaxD K-means also used a novel strategy of setting the initial centroids. The experiment of the Max-D means has been conducted using synthetic data, which is taken from the Llyod’s K-Means experiments. The results from the new algorithm show that the number of iteration improves tremendously, and the number of iterations is reduced by confirming an improvement rate is up to 78%

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

UMP Institutional Repository

oai:umpir.ump.edu.my:27004

Last time updated on 16/03/2020