CFP last date
20 January 2025
Reseach Article

An Intelligent Classifier for Breast Cancer Diagnosis based on K-Means Clustering and Rough Set

by T. Sridevi, A. Murugan
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 85 - Number 11
Year of Publication: 2014
Authors: T. Sridevi, A. Murugan
10.5120/14889-3336

T. Sridevi, A. Murugan . An Intelligent Classifier for Breast Cancer Diagnosis based on K-Means Clustering and Rough Set. International Journal of Computer Applications. 85, 11 ( January 2014), 38-42. DOI=10.5120/14889-3336

@article{ 10.5120/14889-3336,
author = { T. Sridevi, A. Murugan },
title = { An Intelligent Classifier for Breast Cancer Diagnosis based on K-Means Clustering and Rough Set },
journal = { International Journal of Computer Applications },
issue_date = { January 2014 },
volume = { 85 },
number = { 11 },
month = { January },
year = { 2014 },
issn = { 0975-8887 },
pages = { 38-42 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume85/number11/14889-3336/ },
doi = { 10.5120/14889-3336 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:02:12.794760+05:30
%A T. Sridevi
%A A. Murugan
%T An Intelligent Classifier for Breast Cancer Diagnosis based on K-Means Clustering and Rough Set
%J International Journal of Computer Applications
%@ 0975-8887
%V 85
%N 11
%P 38-42
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Feature selection aims to select subset of original features. It removes unrelated, redundant or noisy data from the problem domain. Rough set theory is often applied to feature reduction using the data alone, requiring no additional information and widely used for classification tool in data mining. Clustering, a form of data grouping, groups a set of data such that the intra-cluster similarity is maximized and the inter-cluster similarity is minimized. In this paper, k-means clustering algorithm is applied to partition the given information system and further rough set theory implemented on the data set to generate feature subset. The classification process by means of SVM is performed by using the remaining features. Wisconsin Breast Cancer datasets derived from UCI machine learning database are used for the purpose of testing the proposed hybrid model and the success rate of hybrid model is determined as 99%.

References
  1. Liu H. and Motoda H. , "Feature Selection for Knowledge Discovery and Data Mining ", Kluwer Academic Publisher, 1999.
  2. Jensen R. and Shen Q. ," A Rough Set-Aided System for Sorting WWW Bookmarks", In Zhong N et al. (Eds. ), Web Intelligence: Research and Development, pp. 95-105, 2001.
  3. Dr. DSVGK Kaladhar, Chandana B, and Bharath kumar P. , "Predicting cancer survivability using classification algorithms", IJRRCS, 2(2), pp. 34-343, 2011.
  4. Ohrn A, "Rough sets: A Knowledge Discovery Technique for MultifactorMedical Outcomes", 1999.
  5. Hassanien A. E, Suraj Z, Slezak D, and Lingras P. , "Rough Computing: Theories, Technologies, and Applications", New York: Information Science Reference, 2008.
  6. J. J. Alpigini, J. F. Peters, J. Skowronek, N. Shong (Eds. ): "Rough sets and Current Trends in Computing", Third International Conference, RSCTC 2002. Malvern, PA, USA, October 14-16, 2002. Lecture Notes in Computer Science 2475 Springer 2002, ISBN 3-540-44274-X.
  7. MacQueen, J. B. (1967). "Some Methods for Classification and Analysis of Multivariate Observations. " In Proc. of 5th Berkley Symposium on Mathematical Statistics and Probability, Volume I: Statistics, pp. 281–297.
  8. http://archive. ics. uci. edu/m1/machine-learning-databases/breast-cancer-wisconsin/breast-cancer.
  9. Mohammad Darzi, Ali AsgharLiaei, Mahdi Hosseini, HabibollahAsghari. "Feature Selection for Breast Cancer Diagnosis: A Case-Based Wrapper Approach. " (2011), World academy of Science, Engineering and Technology 77, 2011,pp 1142-1143.
  10. Li-Yeh Chuang, Sheng-Wei Tsai, Cheng-Hong Yang(2011), "Catfish Binary Particle Swarm Optimization for Feature Selection," Proceedings of the international Conference on Machine Learning and Computing IPCSIT vol. 3 (2011)pp 40-44
  11. Chunekar, V. N. ; Ambulgekar, H. P. (2009). "Approach of Neural Network to Diagnose Breast Cancer on Three Different Data Se. " Proceedings Advances in Recent Technologies in Communication and Computing 2009 ARTcom-2009), 27th-28th Oct. , IEEE, Kottayam. pp. : 893-895.
  12. I. Gadaras, L. Mikhailov. "An interpretable fuzzy rule-based classification methodology for medical diagnosis. " Artificial Intelligence in Medicine 47 (1) (2009) 25–41.
  13. J. Abonyi, and F. Szeifert. "Supervised fuzzy clustering for the identification of fuzzy classifiers. " Pattern Recognition Letters, vol. 14 (24), 2195–2207, 2003.
  14. Qinghua Hu, Jinfu Liu, Daren Yu. "Mixed feature selection based on granulation and approximation. "Knowledge-Based System 21, 294-304. 2008.
Index Terms

Computer Science
Information Sciences

Keywords

SVM feature selection rough set clustering classification.