International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 43 - Number 2 |
Year of Publication: 2012 |
Authors: Sovan Kumar Patnaik, Soumya Sahoo, Dillip Kumar Swain |
10.5120/6072-7456 |
Sovan Kumar Patnaik, Soumya Sahoo, Dillip Kumar Swain . Clustering of Categorical Data by Assigning Rank through Statistical Approach. International Journal of Computer Applications. 43, 2 ( April 2012), 1-3. DOI=10.5120/6072-7456
Most of the earlier work on clustering has mainly been focused on numerical data whose inherent geometric properties can be exploited to naturally define distance functions between data points. Working only on numeric values prohibits it from being used to cluster real world data containing categorical values. Recently, the problem of clustering categorical data has started drawing interest. The k-means algorithm is well known for its efficiency in this respect. It is also well known for its efficiency in clustering large data sets. However, in this paper we use the k-means algorithm to categorical domains by assigning rank value to the attributes