CFP last date
22 December 2025
Call for Paper
January Edition
IJCA solicits high quality original research papers for the upcoming January edition of the journal. The last date of research paper submission is 22 December 2025

Submit your paper
Know more
Random Articles
Reseach Article

Early Detection of Heart Disease using a Random Forest Classifier on the UCI Heart Disease Dataset

by Nawir Khalid Al-Waeli, Renad Hazzaa Al-Masabi, Shroq Ahmad Alhamami, Ahmad M.Alkheder Alamami
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 187 - Number 67
Year of Publication: 2025
Authors: Nawir Khalid Al-Waeli, Renad Hazzaa Al-Masabi, Shroq Ahmad Alhamami, Ahmad M.Alkheder Alamami
10.5120/ijca2025926161

Nawir Khalid Al-Waeli, Renad Hazzaa Al-Masabi, Shroq Ahmad Alhamami, Ahmad M.Alkheder Alamami . Early Detection of Heart Disease using a Random Forest Classifier on the UCI Heart Disease Dataset. International Journal of Computer Applications. 187, 67 ( Dec 2025), 29-33. DOI=10.5120/ijca2025926161

@article{ 10.5120/ijca2025926161,
author = { Nawir Khalid Al-Waeli, Renad Hazzaa Al-Masabi, Shroq Ahmad Alhamami, Ahmad M.Alkheder Alamami },
title = { Early Detection of Heart Disease using a Random Forest Classifier on the UCI Heart Disease Dataset },
journal = { International Journal of Computer Applications },
issue_date = { Dec 2025 },
volume = { 187 },
number = { 67 },
month = { Dec },
year = { 2025 },
issn = { 0975-8887 },
pages = { 29-33 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume187/number67/early-detection-of-heart-disease-using-a-random-forest-classifier-on-the-uci-heart-disease-dataset/ },
doi = { 10.5120/ijca2025926161 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2025-12-18T17:50:29.256169+05:30
%A Nawir Khalid Al-Waeli
%A Renad Hazzaa Al-Masabi
%A Shroq Ahmad Alhamami
%A Ahmad M.Alkheder Alamami
%T Early Detection of Heart Disease using a Random Forest Classifier on the UCI Heart Disease Dataset
%J International Journal of Computer Applications
%@ 0975-8887
%V 187
%N 67
%P 29-33
%D 2025
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The heart disease is still among the top causes of death all over the world, which means that there will be an urgent demand for systems that would help in clinical decision making through accurate and early detection. This research made use of the UCI Heart Disease Dataset, which consists of 1,025 patient records and 13 clinical features[1], to predict the diagnosis of heart disease. The dataset contains both numerical and categorical attributes, like age, sex, type of chest pain, resting blood pressure, cholesterol, ECG, maximum heart rate, exercise-induced angina, ST depression (oldpeak), slope, number of vessels, and thalassemia. All categorical variables were encoded with labels, and the data was split into training and testing sets for reliable evaluation to be possible. Only one single supervised machine learning model, as Random Forest Classifier, was chosen as it had proved to be very strong and effective on clinical data with structure. The model was trained with the optimized parameters (300 estimators, max depth = 10) and very competitive results were obtained: accuracy of 97.66%, precision of 100%, recall of 95.2% and F1-score of 97.54%. The confusion matrix assured the robustness of the predictive power by correctly identifying 251 out of 257 test samples. Thus, the Random Forest method is found to be very accurate for the early detection of heart disease and has the potential to be integrated into real-world medical diagnostic systems.

References
  1. Gangal, K. 2021. Heart Disease Dataset UCI. Kaggle. Available at: https://www.kaggle.com/datasets/ketangangal/heart-disease-dataset-uci
  2. Detrano, R., Janosi, A., Steinbrunn, W., Pfisterer, M., Schmid, J. J., Sandhu, S., Guppy, K. H., Lee, S., and Froelicher, V. 1989. International application of a new probability algorithm for the diagnosis of coronary artery disease. American Journal of Cardiology, 64(5), 304–310. https://doi.org/10.1016/0002-9149(89)90524-9
  3. Dua, D. and Graff, C. 2019. UCI Machine Learning Repository: Heart Disease Data Set. University of California, Irvine, School of Information and Computer Sciences. Available at: https://archive.ics.uci.edu/ml/datasets/heart+Disease
  4. Tiwari, A. and Jain, A. 2020. Heart disease prediction using machine learning algorithms. International Journal of Engineering Research & Technology (IJERT), 9(7), 612–617. Available at: https://www.ijert.org/heart-disease-prediction-using-machine-learning-algorithms
  5. Haq, A. U., Li, J. P., Memon, M. H., Nazir, S., Ahmad, S., Sun, R., and Wang, L. 2018. Intelligent machine learning approach for effective recognition of heart disease.IEEE Access, 7, 34938–34945. https://doi.org/10.1109/ACCESS.2019.2905157
  6. Uddin, S., Khan, A., Hossain, M. E., and Moni, M. A. 2019. Comparing different supervised machine learning algorithms for disease prediction. BMC Medical Informatics and Decision Making, 19(1), 1–16. https://doi.org/10.1186/s12911-019-1004-8
  7. Abdullah, A., and Rajalaxmi, R. R. 2012. Predicting coronary heart disease using Random Forest classifier. International Journal of Computer Applications (IJCA). Availableat:https://www.ijcaonline.org/proceedings/icon3c/number3/6020-1021
  8. Aziz, M. B., and Rizvi, S. W. A. 2025. Comparative analysis of machine learning algorithms for heart disease prediction. International Journal of Computer Applications, 187(5), 62–65. DOI: https://doi.org/10.5120/ijca2025924890
  9. Nasution, N., Hasan, M. A., and Nasution, F. B. 2025. Predicting heart disease using machine learning models on the UCI dataset. IT Journal Research and Development. DOI: https://doi.org/10.25299/itjrd.2025.17941
  10. Abdullah, M. 2024. AI-based framework for early detection of heart disease using enhanced neural models. Frontiers in Artificial Intelligence, 7, 1539588. DOI: https://doi.org/10.3389/frai.2024.1539588
  11. Shaikh, M. S., Patidar, P. K., Vaghela, B. A., Pandwal, A. N., and Ali, S. I. 2025. Heart disease risk assessment using Random Forest algorithm. AIP Conference Proceedings, 3343, 030007. DOI: https://doi.org/10.1063/5.0292473
  12. Chulde-Fernández, B., et al. 2025. Classification of heart failure using machine learning: a comparative study. Life, 15(3), 496. DOI: https://doi.org/10.3390/life15030496
Index Terms

Computer Science
Information Sciences

Keywords

Heart Disease Prediction Machine Learning Random Forest Classifier UCI Heart Disease Dataset Clinical Data Analysis Early Diagnosis Classification Models Medical Data Mining Evaluation Metrics Accuracy Precision Recall F1-Score