International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 184 - Number 19 |
Year of Publication: 2022 |
Authors: M. Hemalatha |
10.5120/ijca2022922209 |
M. Hemalatha . Diabetic Patient’s Data Classification and Prediction using Machine Learning Ensemble Algorithm. International Journal of Computer Applications. 184, 19 ( Jun 2022), 14-18. DOI=10.5120/ijca2022922209
In this research paper, the diabetic patent dataset is collected from Indian Pima dataset for Indians. The data is understood and visualized by using Pearson correlation statistics method. According to survey 65% of this data set is non-Diabetics and 35% of Indians are Diabetics. The data is understood better by statistics and visualization. A certain pre-processing of data is performed before applying machine learning algorithms. Then machine learning algorithms are carried out on Indian diabetic data set. The ensemble (Random forest) algorithm has got good performance metrics compared to other existing algorithms. The Random forest algorithm gave outperform results compared to MLP (Multi Layer perception classifier) classifier, Support Vector Machine (SVM) classifier and LR (Logistic Regression) algorithms. The performance metrics of machine learning algorithms are calculated using confusion matrix.