CFP last date
20 January 2025
Reseach Article

Model based Data Imputation

by Vittanala Sai Bhushan, P. Krishna Subba Rao
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 184 - Number 6
Year of Publication: 2022
Authors: Vittanala Sai Bhushan, P. Krishna Subba Rao
10.5120/ijca2022921894

Vittanala Sai Bhushan, P. Krishna Subba Rao . Model based Data Imputation. International Journal of Computer Applications. 184, 6 ( Apr 2022), 1-4. DOI=10.5120/ijca2022921894

@article{ 10.5120/ijca2022921894,
author = { Vittanala Sai Bhushan, P. Krishna Subba Rao },
title = { Model based Data Imputation },
journal = { International Journal of Computer Applications },
issue_date = { Apr 2022 },
volume = { 184 },
number = { 6 },
month = { Apr },
year = { 2022 },
issn = { 0975-8887 },
pages = { 1-4 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume184/number6/32330-2022921894/ },
doi = { 10.5120/ijca2022921894 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T01:20:43.969931+05:30
%A Vittanala Sai Bhushan
%A P. Krishna Subba Rao
%T Model based Data Imputation
%J International Journal of Computer Applications
%@ 0975-8887
%V 184
%N 6
%P 1-4
%D 2022
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Missing or incomplete data is a significant problem in all types of statistical analyses. In this project, multiple imputations using chained equation (MICE) is modified to work with various regression algorithms such as linear regression algorithm. The modified MICE algorithm then will be compared using accuracy on three different datasets.

References
  1. D. Schunk, "A Markov chain Monte Carlo algorithm for multiple imputation in large surveys", AStA Advances in Statistical Analysis, vol. 92, no. 1, pp. 101-114,2008.
  2. A.Ryder et al., "The Advantage of Imputation of Missing Income Da ta to Evaluate the Association Between Income and Self-Reported Health Status (SRH) in a Mexican American Cohort Study", Journal ofImmigrant
  3. M. Azur, E. Stuart, C. Frangakis and P. Leaf, "Multiple imputation by chained equations: what is it and how does it work?", International Journal of Methods in Psychiatric Research, vol. 20, no. 1, pp. 40-49,2011.
  4. C. Padgett, C. Skilbeck and M. Summers, "Missing Data: The Importance and Impact of Missing Data from Clinical Research", Brain Impairment, vol. 15, no. 1, pp. 1-9,2014.
  5. J. Pang, Y. Gu, J. Xu, Y. Bao, and G. Yu, "Efficient graph similarity join with scalable prefix-filtering using mapreduce", In Web-Age Information Management, Springer, pages 415 - 418,2014.
  6. X. Zhao, C. Xiao, W. Zhang, X. Lin, and J. Tang, "Improving Performance of Graph Similarity Joins Using Selected Substructures", Springer International Publishing, Cham, pages 156 -172,2014.
  7. Audigier, F. Husson and J. Josse, "MIMCA: multiple imputation for categorical variables with multiple correspondence analysis", Statistics and Computing, vol. 27, no. 2, pp. 501-518,2016.
  8. J. Jakobsen, C. Gluud, J. Wetterslev and P. Winkel, "When and how should multiple imputation be used for handling missing data in randomised clinical trials – a practical guide with flowcharts", BMC Medical Research Methodology, vol. 17, no. 1,2017.
  9. Jerez, J., Molina, I., García-Laencina, P., Alba, E., Ribelles, N., Martín, M. and Franco, L., 2010. Missing data imputation using statistical and machine learning methods in a real breast cancer problem. Artificial Intelligence in Medicine, 50(2),pp.105-115.
  10. MANSOURIAN, M. and AFSHARI SAFAVI, A., 2017. Handling Missing Data in Questionnaire-Based Studies: A Comparison Between Simple and Imputation Techniques. TurkiyeKlinikleri Journal ofBiostatistics.
  11. Khan, S. and Hoque, A., 2020. SICE: an improved missing data imputation technique. Journal of Big Data, 7(1).
  12. "Imputation(statistics)",En.wikipedia.org,2020. [Online].Available:https://en.wikipedia.org/wiki/Imputati on_(statistics). [Accessed: 03- Sep-2020].
  13. Numpyninja.com,2021.[Online].Available: https://www.numpyninja.com/post/mice-algorithm-to- impute-missing-values-in-a-dataset. [Accessed: 30- Mar- 2021].
Index Terms

Computer Science
Information Sciences

Keywords

Data Imputation MICE Machine learning Multiple Imputation Random Forest.