International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 89 - Number 7 |
Year of Publication: 2014 |
Authors: M. Ramakrishnan, D. Tennyson Jyaraj |
10.5120/15518-4102 |
M. Ramakrishnan, D. Tennyson Jyaraj . Modified K-Means Algorithm for Effective Clustering of Categorical Data Sets. International Journal of Computer Applications. 89, 7 ( March 2014), 39-42. DOI=10.5120/15518-4102
Traditional k-means algorithm is well known for its clustering ability and efficiency on large amount of data sets. But this method is well suited for numeric values only and cannot be effectively used for categorical data sets. In this paper, we present modified k-means algorithms that can that can perform clustering very effectively on mixed data sets. The main intuition behind our proposed method is that all prototypes are the potential candidates at the root level. For the children of the root node, we can prune the candidate set by using simple geometrical constraints. The experimental results show that this method is well suited for categorical data sets and overall time of computation is very minimal.