We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 December 2024
Reseach Article

Article:Entropy Weighting Genetic k-Means Algorithm for Subspace Clustering

by Anil Kumar Tiwari, Lokesh Kumar Sharma, G. Rama Krishna
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 7 - Number 7
Year of Publication: 2010
Authors: Anil Kumar Tiwari, Lokesh Kumar Sharma, G. Rama Krishna
10.5120/1263-1628

Anil Kumar Tiwari, Lokesh Kumar Sharma, G. Rama Krishna . Article:Entropy Weighting Genetic k-Means Algorithm for Subspace Clustering. International Journal of Computer Applications. 7, 7 ( October 2010), 27-30. DOI=10.5120/1263-1628

@article{ 10.5120/1263-1628,
author = { Anil Kumar Tiwari, Lokesh Kumar Sharma, G. Rama Krishna },
title = { Article:Entropy Weighting Genetic k-Means Algorithm for Subspace Clustering },
journal = { International Journal of Computer Applications },
issue_date = { October 2010 },
volume = { 7 },
number = { 7 },
month = { October },
year = { 2010 },
issn = { 0975-8887 },
pages = { 27-30 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume7/number7/1263-1628/ },
doi = { 10.5120/1263-1628 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T19:55:44.252880+05:30
%A Anil Kumar Tiwari
%A Lokesh Kumar Sharma
%A G. Rama Krishna
%T Article:Entropy Weighting Genetic k-Means Algorithm for Subspace Clustering
%J International Journal of Computer Applications
%@ 0975-8887
%V 7
%N 7
%P 27-30
%D 2010
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This paper presents a genetic k-means algorithm for clustering high dimensional objects in subspaces. High dimensional data faces data sparsity problem. In this algorithm, we present the genetic k-means clustering process to calculate a weight for each dimension in each cluster and use the weight values to identify the subsets of important dimensions that categorize different clusters. This is achieved by including the weight entropy in the objective function that is minimized in the k-means clustering process. Further, the use of genetic algorithm ensure for converge to the global optimum. The experiments on UCI data has reported that this algorithm can generate better clustering results than other subspace clustering algorithms.

References
  1. D.S. Modha and W.S. Spangler, Feature weighting in k-Means Clustering,Machine learning, vol. 52, pp.217-237, 2003.
  2. Huan, J.Z., Ng, M.K. Hongqiang Rong, Zichen Li, Automated variable weighting in k-Means type clustering, IEEE Transactions on pattern Analysis and Machine Intelligence, vol. 27 Issue 5, May 2005 pages : 657-668.
  3. J.H. Friedman and J. J. Meulman, Clustering Objects on Subsets on subsets of Attributes, J. Royal Statstical Soc. B., 2002.
  4. K. Krishna and M. N. Murty, Genetic K-Means Algorithm, IEEE Transactions on Systems, Man, and Cybernetics vol. 29, NO. 3, (1999), 433-439.
  5. L. Jing, M. K. Ng and J. Z. Huang , ‘An Entropy weighting k-Means Algorithm for subspace clustering of high dimensional sparse data’, IEEE Transaction on knowledge and Data Engineering Vol 19, No 8, August 2007.
  6. W. Frawley, G.Piatetsky-Shapiro, C. Matheus, Knowledge discovery in databases: an overview, AI Magazine (1992) pp. 213-228.
  7. Y. Lu, S. Lu , F. Fotouhi ,Y. Deng , and S.J. Brown, FGKA: A Fast Genetic K-means Clustering Algorithm, ACM Symposium on Applied Computing ISBN:1-58113-812-1 (2004), 622-623
  8. Y. Lu, S. Lu, F. Fotouhi,Y. Deng and S. J. Brown S. J, Incremental genetic K-means algorithm and its application in gene expression data analysis, BMC Bioinformatics (2004), 5(172).
  9. Z. Yu and H. S. Wong, Genetic based k-means algorithm for selection of feature variables, IEEE ICPR’06, 2006.
  10. S. Hettich and S. D. Bay, The UCI KDD Archive [http://kdd.ics.uci.edu] Invine, CA: University of California, Department of Information and Computer Science.
Index Terms

Computer Science
Information Sciences

Keywords

Genetic Algorithm Clustering Subspace clustering