International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 157 - Number 6 |
Year of Publication: 2017 |
Authors: Arpit Bansal, Mayur Sharma, Shalini Goel |
10.5120/ijca2017912719 |
Arpit Bansal, Mayur Sharma, Shalini Goel . Improved K-mean Clustering Algorithm for Prediction Analysis using Classification Technique in Data Mining. International Journal of Computer Applications. 157, 6 ( Jan 2017), 35-40. DOI=10.5120/ijca2017912719
Clustering is technique which is used to analyze the data in efficient manner and generate required information. To cluster the dataset, there is a technique named k-mean, is applied which is based on central point selection and calculation of Euclidian Distance. Here in k-mean, dataset will be loaded and from the dataset. Central points are selected using the formulae Euclidian distance and on the basis of Euclidian distance points are assigned to the clusters. The main disadvantage of k-mean is of accuracy, as in k-mean clustering user needs to define number of clusters. Because of user defined number of clusters, some points of the dataset are remained un-clustered. In this work, improvement in the k-mean clustering algorithm will be proposed which can define number of clusters automatically and assign required cluster to un-clustered points. The proposed improvement will leads to improvement in accuracy and reduce clustering time by the member assigned to the cluster to predict cancer.