International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 63 - Number 20 |
Year of Publication: 2013 |
Authors: Sunita Jahirabadkar, Parag Kulkarni |
10.5120/10584-5732 |
Sunita Jahirabadkar, Parag Kulkarni . Clustering for High Dimensional Data: Density based Subspace Clustering Algorithms. International Journal of Computer Applications. 63, 20 ( February 2013), 29-35. DOI=10.5120/10584-5732
Finding clusters in high dimensional data is a challenging task as the high dimensional data comprises hundreds of attributes. Subspace clustering is an evolving methodology which, instead of finding clusters in the entire feature space, it aims at finding clusters in various overlapping or non-overlapping subspaces of the high dimensional dataset. Density based subspace clustering algorithms treat clusters as the dense regions compared to noise or border regions. Many momentous density based subspace clustering algorithms exist in the literature. Each of them is characterized by different characteristics caused by different assumptions, input parameters or by the use of different techniques etc. Hence it is quite unfeasible for the future developers to compare all these algorithms using one common scale. In this paper, we presented a review of various density based subspace clustering algorithms together with a comparative chart focusing on their distinguishing characteristics such as overlapping / non-overlapping, axis parallel / arbitrarily oriented and so on.