International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 186 - Number 5 |
Year of Publication: 2024 |
Authors: Erfin Nur Rohma Khakim, Erik Iman Heri Ujianto |
10.5120/ijca2024923386 |
Erfin Nur Rohma Khakim, Erik Iman Heri Ujianto . Comparative Analysis of Classification Algorithms for Citizens Welfare Status using PCA as Feature Selection. International Journal of Computer Applications. 186, 5 ( Jan 2024), 30-37. DOI=10.5120/ijca2024923386
The government has launched various programs to improve the welfare of citizens in order to solve the problem of poverty. The problem in poverty alleviation is on its databases. Classification of the level of welfare conventionally with the estimation method causes the classification results to be invalid. In addition, many poor people who should be the target recipients of poverty alleviation programs have yet to be recorded. This study proposes a machine learning data mining method to classify the welfare of citizens so that the results of the category of welfare levels are more computable and valid. The proposed algorithms are Naïve Bayes, Decision Tree and K-Nearest Neighbor (K-NN) and using Principal Component Analysis (PCA) as feature selection and normalization method on the preprocessing. The data that used in this research is Data Indikator Kesejahteraan Sosial (IKS). IKS data is data collected from residents of Bantul Regency in 2022. The IKS data currently consists of 95,347 rows and uses 27 attributes. There are 4 (four) class or label in this dataset include very poor, poor, nearly poor and not poor. The results of the test show that generally the best algorithm performance is K-NN with accuracy, precision and recall values respectively 96.71%, 95.16% and 88.79%. In this study, using PCA and the normalization method also had a significant effect on improving the performance of the classification algorithm. For further research, it is expected to be able to use deep learning algorithms in classifying because it has large data dimensions.