CFP last date
20 January 2025
Reseach Article

Classification and Regression Analysis of the Prognostic Breast Cancer using Generation Optimizing Algorithms

by Rafaqat Alam Khan, Nasir Ahmad, Nasru Minallah
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 68 - Number 25
Year of Publication: 2013
Authors: Rafaqat Alam Khan, Nasir Ahmad, Nasru Minallah
10.5120/11754-7423

Rafaqat Alam Khan, Nasir Ahmad, Nasru Minallah . Classification and Regression Analysis of the Prognostic Breast Cancer using Generation Optimizing Algorithms. International Journal of Computer Applications. 68, 25 ( April 2013), 42-47. DOI=10.5120/11754-7423

@article{ 10.5120/11754-7423,
author = { Rafaqat Alam Khan, Nasir Ahmad, Nasru Minallah },
title = { Classification and Regression Analysis of the Prognostic Breast Cancer using Generation Optimizing Algorithms },
journal = { International Journal of Computer Applications },
issue_date = { April 2013 },
volume = { 68 },
number = { 25 },
month = { April },
year = { 2013 },
issn = { 0975-8887 },
pages = { 42-47 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume68/number25/11754-7423/ },
doi = { 10.5120/11754-7423 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:28:54.603300+05:30
%A Rafaqat Alam Khan
%A Nasir Ahmad
%A Nasru Minallah
%T Classification and Regression Analysis of the Prognostic Breast Cancer using Generation Optimizing Algorithms
%J International Journal of Computer Applications
%@ 0975-8887
%V 68
%N 25
%P 42-47
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Breast cancer is one of the main causes of female fatality all over the world and is the major field of research since quite a long time with lesser improvement than expected. Many institutions and organizations are working in this field to lead to a possible solution of the problem or to lead to more understanding of the problem. Many previous researches were studied for better understanding of the problem and the work done already to remove redundancy and contribute to the field, Wisconsin-Madison prognostic Breast cancer (WPBC) data set from the UCI machine learning repository was used for training of 198 individual cases by selecting best features out of 34 predictors. Feature selection algorithms were used with machine learning algorithms for feature reduction and for better classification. Different feature selection and generation algorithms were used to improve the accuracy of classification. Many improvements in accuracies were found out by using different approaches than the earlier studies conducted in the same field. The Naïve Bayes and Logistic Regression algorithms showed 8. 28-12. 32% and 0. 82-1. 52% accuracy via 10 fold cross validation analysis improvement accordingly by using different feature selection and generation algorithms with these classifiers and gave better result than the best results known for these classification algorithms.

References
  1. Shomona Gracia Jacob and R. Geetha Ramani. 2012 "Efficient Classifier for Classification of Prognostic Breast Cancer Data through Data Mining Techniques"
  2. J. D. Malley, J. Kruppa, A. Dasgupta, K. G. Malley and A. Ziegler. 2012 Probability Machines; Consistent Probability Estimation Using Non Parametric Learning Machines, Methods Inf Med. 2012; 51:74–81. doi: 10. 3414/ME00-01-0052,2012.
  3. William H. Wolberg, Olvi Mangasarian, UCI Machine Learning Repository [http://archive. ics. uci. edu/ml]. Irvine, CA.
  4. Abraham Karplus. 2012 "Machine Learning Algorithms for Cancer Diagnosis, Santa Cruz County Science Fair".
  5. Wisconsin-Madison prognostic Breast cancer Repositoryftp:ftp. cs. wisc. edu/math-prog/cpo-dataset/machine-learn/cancer/WPBC/WPBC. dat
  6. W. H. Wolberg, W. N. Street, D. M. Heisey, and O. L. Mangasarian. 1995 "Computerized breast cancer Diagnosis and prognosis from fine needle aspirates". Archives of Surgery 1995; 130:511-516
  7. Mierswa, Ingo and Wurst, Michael and Klinkenberg, Ralf and Scholz, Martin and Euler, Timm: YALE: Rapid Prototyping for Complex Data Mining Tasks, in Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-06), 2006.
  8. Sarkar M, Leong TY. 2000 "Application of K-nearest neighbor's algorithm on breast cancer diagnosis problem". Proc AMIA Symp. 2000:759–763.
  9. Gouda I. Salama1, M. B. Abdelhalim2, and Magdy Abd-elghanyZeid3. 2012 "Breast Cancer Diagnosis on Three Different Datasets Using Multi-Classifiers" International Journal of Computer and Information Technology (2277 – 0764) Volume 01– Issue 01, September 2012
  10. William H. Wolberg, W. Nick Street, Dennis M. Heisey and Olvi L. Mangasarian. 1995 "Computer-Derived Nuclear Features Distinguish Malignant from Benign Breast Cytology" volume 26, No 7.
  11. D. Lavanya, Dr. K. Usha Rani. 2011 "ANALYSIS OF FEATURE SELECTION WITH CLASSFICATION: BREAST CANCER DATASETS" ISSN: 0976-5166 Vol. 2 No. 5 Oct-Nov 2011
  12. W. H. Wolberg, W. N. Street, and O. L. Mangasarian, Image analysis and machine learning applied to Breast cancer diagnosis and prognosis, Analytical and Quantitative Cytology and Histology, Vol. 17, No. 2, pages 77-87, April 1995.
  13. W. H. Wolberg, W. N. Street, D. M. Heisey, and O. L. Mangasarian. 1995 "Computer-derived nuclear ``grade'' and breast cancer prognosis, Analytical and Quantitative Cytology and Histology", Vol. 17, Pages 257-264, 1995.
  14. Canadian Cancer Society's Steering Committee on Cancer Statistics. Canadian Cancer Statistics 2012. Toronto, ON: Canadian Cancer Society; 2012. May 2012 ISSN 0835-2976.
Index Terms

Computer Science
Information Sciences

Keywords

Naïve Bayes Feature Selection Logistic