CFP last date
20 February 2025
Reseach Article

Analysis of Machine Learning Algorithms for prediction and classification of Breast Cancer

by Aakarsh Goel, Abhishek Chauhan, Daksh Pal, Rahul Singh
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 186 - Number 20
Year of Publication: 2024
Authors: Aakarsh Goel, Abhishek Chauhan, Daksh Pal, Rahul Singh
10.5120/ijca2024923632

Aakarsh Goel, Abhishek Chauhan, Daksh Pal, Rahul Singh . Analysis of Machine Learning Algorithms for prediction and classification of Breast Cancer. International Journal of Computer Applications. 186, 20 ( May 2024), 43-48. DOI=10.5120/ijca2024923632

@article{ 10.5120/ijca2024923632,
author = { Aakarsh Goel, Abhishek Chauhan, Daksh Pal, Rahul Singh },
title = { Analysis of Machine Learning Algorithms for prediction and classification of Breast Cancer },
journal = { International Journal of Computer Applications },
issue_date = { May 2024 },
volume = { 186 },
number = { 20 },
month = { May },
year = { 2024 },
issn = { 0975-8887 },
pages = { 43-48 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume186/number20/analysis-of-machine-learning-algorithms-for-prediction-and-classification-of-breast-cancer/ },
doi = { 10.5120/ijca2024923632 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-05-24T23:33:16.169841+05:30
%A Aakarsh Goel
%A Abhishek Chauhan
%A Daksh Pal
%A Rahul Singh
%T Analysis of Machine Learning Algorithms for prediction and classification of Breast Cancer
%J International Journal of Computer Applications
%@ 0975-8887
%V 186
%N 20
%P 43-48
%D 2024
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The most common disease that can be seen in women is breast cancer. According to 2021 statistics it was found that 281,550 new cancer cases were discovered in US. Due to rapid increase in death because to breast cancer, there is a need to find an effective solution to this problem. As we know that ML algorithms helps in providing solution with better accuracy. In this paper we have applied several ML algorithms like DT (Decision tree), RF (Random Forest) Classifier, NB (Naïve Bayes) classifier, KNN, ADABOOST, GBDT, SVM (Support Vector Machine), SGD, RF (Random Forest) Classifier. And we have applied feature selection to extract best attributes so that ML classifier can provide better accuracy to our model and helps in saving life of many peoples. The accuracy of GDBT is 97%, SVM classifier is 96.4% , ADABOOST is 96%, SGD is 94% , RF classifier is 92%, KNN is 90% DT classifier 90% and NB classifier 90% . Out of all GBDT provides best accuracy which is 97%.

References
  1. Abbas S, Jalil Z, Javed AR, Batool I, Khan MZ, Noorwali A, Gadekallu TR, Akbar A. 2021. BCD-WERT: a novel approach for breast cancer detection using whale optimization based efficient features and extremely randomized tree algorithm. PeerJ Computer Science 7:e390 https://doi.org/10.7717/peerj-cs.390
  2. Chugh, G., Kumar, S. & Singh, N. Survey on Machine Learning and Deep Learning Applications in Breast Cancer Diagnosis. Cogn Comput 13, 1451–1470 (2021). https://doi.org/10.1007/s12559-020-09813-6
  3. Aswathy M. A., & Mohan, J. (2020). Analysis of Machine Learning Algorithms for Breast Cancer Detection. In S. Velayutham (Ed.), Handbook of Research on Applications and Implementations of Machine Learning Techniques (pp. 1-20). IGI Global. https://doi.org/10.4018/978-1-5225-9902-9.ch001
  4. Li J, Zhou Z, Dong J, Fu Y, Li Y, Luan Z, et al. (2021) Predicting breast cancer 5-year survival using machine learning: A systematic review. PLoS ONE 16(4): e0250370. https://doi.org/10.1371/ journal.pone.0250370
  5. Deshmukh, P. R., & Phalnikar, R. (2021). Information extraction for prognostic stage prediction from breast cancer medical records using NLP and ML. Medical & Biological Engineering & Computing, 59(9), 1751–1772. doi:10.1007/s11517-021-02399-7
  6. R. MurtiRawat, S. Panchal, V. K. Singh and Y. Panchal, "Breast Cancer Detection Using K-Nearest Neighbors, Logistic Regression and Ensemble Learning," 2020 International Conference on Electronics and Sustainable Communication Systems (ICESC), 2020, pp. 534-540, doi: 10.1109/ICESC48915.2020.9155783.
  7. Jabbar, M. A. . (2021). Breast Cancer Data Classification Using Ensemble Machine Learning. Engineering and Applied Science Research, 48(1), 65–72. Retrieved from https://ph01.tci-thaijo.org/index.php/easr/article/view/234959 (breast cancer is .....correct treatment)
  8. Wu, & Hicks, C. (2021). Breast Cancer Type Classification Using Machine Learning. Journal of Personalized Medicine, 11(2), 61. https://doi.org/10.3390/jpm11020061 (Breast cancer is a ….negative breast cancer types)
  9. Rahman, M. M., Ghasemi, Y., Suley, E., Zhou, Y., Wang, S., & Rogers, J. (2021). Machine Learning Based Computer Aided Diagnosis of Breast Cancer Utilizing Anthropometric and Clinical Features. IRBM, 42(4), 215–226. doi:10.1016/j.irbm.2020.05.005
  10. P. Tang, X. Yang, Y. Nan, S. Xiang and Q. Liang, "Feature Pyramid Nonlocal Network With Transform Modal Ensemble Learning for Breast Tumour Segmentation in Ultrasound Images," in IEEE Transactions on Ultrasonic, Ferroelectrics, and Frequency Control, vol. 68, no. 12, pp. 3549-3559, Dec. 2021, doi: 10.1109/TUFFC.2021.3098308.
  11. Luque L., Villareal–González R., Vásquez L.C. (2021) Integration of Data Mining Classification Techniques and Ensemble Learning for Predicting the Type of Breast Cancer Recurrence. In: Patgiri R., Bandyopadhyay S., Balas V.E. (eds) Proceedings of International Conference on Big Data, Machine Learning and Applications. Lecture Notes in Networks and Systems, vol 180. Springer, Singapore. https://doi.org/10.1007/978-981-33-4788-5_5
  12. Su, Wu, J., Gu, D., Yang, S., Deng, S., & Khakimova, A. K. (2021). An Adaptive Deep Ensemble Learning Method for Dynamic Evolving Diagnostic Task Scenarios. Diagnostics, 11(12), 2288. https://doi.org/10.3390/diagnostics11122288
  13. Chan H-C, Chattopadhyay A, Chuang EY and Lu T-P (2021) Development of a Gene-Based Prediction Model for Recurrence of Colorectal Cancer Using an Ensemble Learning Algorithm. Front. Oncol. 11:631056. doi: 10.3389/fonc.2021.631056
  14. Raj, A. N. J., Nersisson, R., Mahesh, V. G. V., & Zhuang, Z. (2021). Nipple Localization in Automated Whole Breast Ultrasound Coronal Scans Using Ensemble Learning. Ultrasonic Imaging, 43(1), 29–45. https://doi.org/10.1177/0161734620974273
  15. Sohail, A., Khan, A., Nisar, H., Tabassum, S., & Zameer, A. (2021). Mitotic nuclei analysis in breast cancer histopathology images using deep ensemble classifier. Medical Image Analysis, 72, 102121. doi:10.1016/j.media.2021.102121
  16. Saber, M. Sakr, O. M. Abo-Seida, A. Keshk and H. Chen, "A Novel Deep-Learning Model for Automatic Detection and Classification of Breast Cancer Using the Transfer-Learning Technique," in IEEE Access, vol. 9, pp. 71194-71209, 2021, doi: 10.1109/ACCESS.2021.3079204.
  17. Boumaraf, S., Liu, X., Wan, Y., Zheng, Z., Ferkous, C., Ma, X., Li, Z., & Bardou, D. (2021). Conventional Machine Learning versus Deep Learning for Magnification Dependent Histopathological Breast Cancer Image Classification: A Comparative Study with Visual Explanation. Diagnostics, 11(3), 528. https://doi.org/10.3390/diagnostics11030528
  18. Houssein, E. H., Emam, M. M., Ali, A. A., & Suganthan, P. N. (2021). Deep and machine learning techniques for medical imaging-based breast cancer: A comprehensive review. Expert Systems with Applications, 167, 114161. doi:10.1016/j.eswa.2020.114161
  19. Figueroa, J.D., Gierach, G.L., Duggan, M.A. et al. Risk factors for breast cancer development by tumor characteristics among women with benign breast disease. Breast Cancer Res 23, 34 (2021). https://doi.org/10.1186/s13058-021-01410-1
  20. Bychkov, D., Linder, N., Tiulpin, A. et al. Deep learning identifies morphological features in breast cancer predictive of cancer ERBB2 status and trastuzumab treatment efficacy. Sci Rep 11, 4037 (2021). https://doi.org/10.1038/s41598-021-83102-6
  21. Zeleznik, R., Weiss, J., Taron, J. et al. Deep-learning system to improve the quality and efficiency of volumetric heart segmentation for breast cancer. npj Digit. Med. 4, 43 (2021). https://doi.org/10.1038/s41746-021-00416-5
  22. P. T. Dalvi and N. Vernekar, "Anemia detection using ensemble learning techniques and statistical models," 2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT), 2016, pp. 1747-1751, doi: 10.1109/RTEICT.2016.7808133.
  23. Basker N., Theetchenya S., Vidyabharathi D., Dhaynithi J., Mohanraj G., Marimuthu M., Vidhya G. (2021). Breast Cancer Detection Using Machine Learning Algorithms. Annals of the Romanian Society for Cell Biology, 2551–2562. Retrieved from
  24. Islam, M.M., Haque, M.R., Iqbal, H. et al. Breast Cancer Prediction: A Comparative Study Using Machine Learning Techniques. SN COMPUT. SCI. 1, 290 (2020). https://doi.org/10.1007/s42979-020-00305-w
  25. Shler Farhad Khorshid, & Adnan Mohsin Abdulazeez. (2021). BREAST CANCER DIAGNOSIS BASED ON K-NEAREST NEIGHBORS: A REVIEW. PalArch’s Journal of Archaeology of Egypt / Egyptology, 18(4), 1927-1951. Retrieved from https://archives.palarch.nl/index.php/jae/article/view/6601
  26. Turgut, Siyabend; Dagtekin, Mustafa; Ensari, Tolga (2018). [IEEE 2018 Electric Electronics, Computer Science, Biomedical Engineerings' Meeting (EBBT) - Istanbul, Turkey (2018.4.18-2018.4.19)] 2018 Electric Electronics, Computer Science, Biomedical Engineerings' Meeting (EBBT) - Microarray breast cancer data classification using machine learning methods. , (), 1–3. doi:10.1109/EBBT.2018.8391468
Index Terms

Computer Science
Information Sciences

Keywords

Breast Cancer Machine Learning EDA