ICST Conference on Heterogeneous Networking for Quality, Reliability, Security and Robustness |
Foundation of Computer Science USA |
QSHINE - Number 1 |
December 2013 |
Authors: Sadhana Tiwari, Tanu Solanki |
b319dc1f-a569-457a-aa35-cb2f35ab3854 |
Sadhana Tiwari, Tanu Solanki . An Optimized Approach for k-means Clustering. ICST Conference on Heterogeneous Networking for Quality, Reliability, Security and Robustness. QSHINE, 1 (December 2013), 5-7.
Cluster analysis method is one of the most analytical methods of data mining. The method will directly influence the result of clustering. This paper discusses the standard of k-mean clustering and analyzes the shortcomings of standard k-means such as k-means algorithm calculates distance of each data point from each cluster centre. Calculating this distance in each iteration makes the algorithm of low efficiency. This paper introduces an optimized algorithm which solves this problem. This is done by introducing a simple data structure to store some information in every iteration and used this information in next iteration. The introduced algorithm does not require calculating the distance of each data point from each cluster centre in each iteration due to which running time of algorithm is saved. Experimental results show that the improved algorithm can efficiently improve the speed of clustering and accuracy by reducing the computational complexity of standard k-means algorithm.