CFP last date
22 September 2025
Call for Paper
October Edition
IJCA solicits high quality original research papers for the upcoming October edition of the journal. The last date of research paper submission is 22 September 2025

Submit your paper
Know more
Random Articles
Reseach Article

Diagnosing Coronary Artery Disease with Perfect Machine Learning Methods and SHAP Explainability

by Hardeep Kaur, Sandeep Kaur Dhanda, Simarjot Kaur
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 187 - Number 33
Year of Publication: 2025
Authors: Hardeep Kaur, Sandeep Kaur Dhanda, Simarjot Kaur
10.5120/ijca2025925606

Hardeep Kaur, Sandeep Kaur Dhanda, Simarjot Kaur . Diagnosing Coronary Artery Disease with Perfect Machine Learning Methods and SHAP Explainability. International Journal of Computer Applications. 187, 33 ( Aug 2025), 37-44. DOI=10.5120/ijca2025925606

@article{ 10.5120/ijca2025925606,
author = { Hardeep Kaur, Sandeep Kaur Dhanda, Simarjot Kaur },
title = { Diagnosing Coronary Artery Disease with Perfect Machine Learning Methods and SHAP Explainability },
journal = { International Journal of Computer Applications },
issue_date = { Aug 2025 },
volume = { 187 },
number = { 33 },
month = { Aug },
year = { 2025 },
issn = { 0975-8887 },
pages = { 37-44 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume187/number33/diagnosing-coronary-artery-disease-with-perfect-machine-learning-methods-and-shap-explainability/ },
doi = { 10.5120/ijca2025925606 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2025-08-20T21:35:37.044846+05:30
%A Hardeep Kaur
%A Sandeep Kaur Dhanda
%A Simarjot Kaur
%T Diagnosing Coronary Artery Disease with Perfect Machine Learning Methods and SHAP Explainability
%J International Journal of Computer Applications
%@ 0975-8887
%V 187
%N 33
%P 37-44
%D 2025
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Diagnosis of Coronary Artery Disease (CAD)-one of the highest non-communicable diseases that are affecting millions of people in every corner of the world-requires suitable diagnostic instruments that are effective, precise, and easy to understand. The study proposes an early detection machine learning pipeline for CAD based on the Z-Alizadeh Sani dataset. This pipeline consists of domain-specific preprocessing, SMOTE-based class balancing, hybrid feature selection using RFECV and RFE trimming, and evaluation using several classifiers. XGBoost outperformed all models that were employed with a sensitivity of 0.9637 and ROC AUC of 0.9503. To increase the safety of the categorization, a clinically tuned threshold was used. SHAP analysis revealed the key variables to model openness, such as common chest pain, EFTTE, DM, and BMI. The suggested method stands superior with its diagnostic sensitivity and interpretability when compared with existing norms for clinical applicability.

References
  1. World Health Organization. 2021. Cardiovascular diseases (CVDs). Available at: https://www.who.int
  2. Yusuf, S., Hawken, S., Ôunpuu, S., Dans, T., Avezum, A., Lanas, F., et al. 2004. Effect of potentially modifiable risk factors associated with myocardial infarction in 52 countries. The Lancet, 364(9438), 937–952.
  3. Tavakol, M., Ashraf, S., and Brener, S. J. 2012. Coronary angiography: Background, complications and limitations. Heart, 98(12), 915–921.
  4. Heo, R., Nakazato, R., Kalra, D., and Min, J. K. 2014. Noninvasive imaging in coronary artery disease. Seminars in Nuclear Medicine, 44(5), 398–409.
  5. Rajpurkar, P., Irvin, J., Chen, E., Zhu, K., and Ng, A. 2023. AI in healthcare: The path forward. NEJM AI, 1(1), 3–13.
  6. Abdar, M., Zomorodi-Moghadam, M., Das, R., and Ting, I. H. 2019. A new machine learning-based classification for diagnosis of coronary artery disease. Computer Methods and Programs in Biomedicine, 179, 104992.
  7. Naji, M. S., Hashim, S. Z. M., and Salih, M. M. 2024. Artificial neural network model for predicting coronary artery disease using LASSO-selected features. International Journal of Computing and Digital Systems, 13(1), 33–41.
  8. Garavand, A., Behmanesh, A., and Samadbeik, M. 2022. Efficient model for coronary artery disease diagnosis: A comparative study. Journal of Healthcare Engineering, Article ID 5597723.
  9. Bilal, M., Khan, M. J., Raza, B., and Alzahrani, M. 2023. Comparison of machine learning algorithms for coronary artery disease detection using feature selection. Indonesian Journal of Electrical Engineering and Computer Science, 31(1), 187–193.
  10. Muhammad, L. J., Algehyne, E. A., Usman, S. S., and Dada, E. G. 2021. Supervised machine learning models for coronary artery disease prediction: Performance analysis and comparison. Applied Sciences, 11(6), 2874.
  11. Kataria, V., and Kumar, N. 2025. Predicting coronary artery disease using explainable machine learning. International Journal of Scientific Research in Engineering and Management, 9(1), 1–6.
  12. Wang, J., Xue, Q., Zhang, C. W. J., and Tang, X. 2024. Explainable coronary artery disease prediction model based on AutoGluon from AutoML framework. Frontiers in Cardiovascular Medicine, 11, 1360548.
  13. Amini, M., Pursamimi, M., and Hajianfar, G. 2023. Machine learning-based diagnosis and risk classification of coronary artery disease using myocardial perfusion imaging SPECT: A radiomics study. Scientific Reports, 13, 14920.
  14. Refaeilzadeh, P., Tang, L., and Liu, H. 2009. Cross-validation. In Encyclopedia of Database Systems. Springer.
  15. Chawla, N. V., Bowyer, K. W., Hall, L. O., and Kegelmeyer, W. P. 2002. SMOTE: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research, 16, 321–357.
  16. Douzas, G., and Bacao, F. 2018. Effective data generation for imbalanced learning. Expert Systems with Applications, 91, 464–471.
  17. Hosmer, D. W., and Lemeshow, S. 2000. Applied Logistic Regression. 2nd ed., Wiley.
  18. Cortes, C., and Vapnik, V. 1995. Support-vector networks. Machine Learning, 20, 273–297.
  19. Breiman, L. 2001. Random forests. Machine Learning, 45(1), 5–32.
  20. Friedman, J. H. 2001. Greedy function approximation: A gradient boosting machine. Annals of Statistics, 29(5), 1189–1232.
  21. Chen, T., and Guestrin, C. 2016. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD, 785–794.
  22. Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., Ye, Q., and Liu, T.-Y. 2017. LightGBM: A highly efficient gradient boosting decision tree. Advances in Neural Information Processing Systems, 30.
  23. Hornik, K., Stinchcombe, M., and White, H. 1989. Multilayer feedforward networks are universal approximators. Neural Networks, 2(5), 359–366.
  24. Bergstra, J., and Bengio, Y. 2012. Random search for hyper-parameter optimization. Journal of Machine Learning Research, 13, 281–305.
  25. Fawcett, T. 2006. An introduction to ROC analysis. Pattern Recognition Letters, 27(8), 861–874.
  26. Esteva, A., Kuprel, B., Novoa, R. A., Ko, J., Swetter, S. M., Blau, H. M., and Thrun, S. 2019. A guide to deep learning in healthcare. Nature Medicine, 25(1), 24–29.
  27. Lundberg, S. M., and Lee, S.-I. 2017. A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems, 30.
Index Terms

Computer Science
Information Sciences

Keywords

Coronary Artery Disease Machine Learning SHAP XGBoost Feature Selection SMOTE Medical Diagnosis