CFP last date
20 January 2025
Reseach Article

Comparative study of Naïve Bayes Classifier and KNN for Tuberculosis

Published on November 2011 by Hardik Maniya, Mosin I. Hasan, Komal P. Patel
International Conference on Web Services Computing
Foundation of Computer Science USA
ICWSC - Number 1
November 2011
Authors: Hardik Maniya, Mosin I. Hasan, Komal P. Patel
8385db82-a5a1-4019-89e1-3314e4914d3f

Hardik Maniya, Mosin I. Hasan, Komal P. Patel . Comparative study of Naïve Bayes Classifier and KNN for Tuberculosis. International Conference on Web Services Computing. ICWSC, 1 (November 2011), 21-26.

@article{
author = { Hardik Maniya, Mosin I. Hasan, Komal P. Patel },
title = { Comparative study of Naïve Bayes Classifier and KNN for Tuberculosis },
journal = { International Conference on Web Services Computing },
issue_date = { November 2011 },
volume = { ICWSC },
number = { 1 },
month = { November },
year = { 2011 },
issn = 0975-8887,
pages = { 21-26 },
numpages = 6,
url = { /proceedings/icwsc/number1/3972-wsc005/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 International Conference on Web Services Computing
%A Hardik Maniya
%A Mosin I. Hasan
%A Komal P. Patel
%T Comparative study of Naïve Bayes Classifier and KNN for Tuberculosis
%J International Conference on Web Services Computing
%@ 0975-8887
%V ICWSC
%N 1
%P 21-26
%D 2011
%I International Journal of Computer Applications
Abstract

Data mining is applied in medical field since long back to predict disease like diseases of the heart, lungs and various tumors based on the past data collected from the patient. In India, though the data collection of medical patient is not streamlined, we made an effort to predict the most widely spread disease in India named tuberculosis. Using data collected from various TB centers, we made an effort to fetch out hidden patterns and by learning this pattern through the collected data for tuberculosis we can diagnose and predict the disease. In the research work we are comparing naïve bayes classifier and KNN, two the most effective techniques for data classification (especially for medical diagnoses), implemented using C language and using Weka tool respectively and classify the patient affected by tuberculosis into two categories (least probable and most probable). We have used 19 symptoms of tuberculosis and collect 154 cases. We have achieved nearly 78% accuracy with low false negative.

References
  1. www.tbevidence.org/documents/dxres/models/tb_diagnostics.pdf World Health Organization
  2. Intelligent Heart Disease Prediction System Using Data Mining Techniques by Sellappan Palaniappan and Rafiah Awang IJCSNS International Journal of Computer Science and Network Security, VOL.8 No.8, August 2008
  3. Machine Learning for Medical Diagnosis: History, State of the Art and Perspective by Igor Kononenko
  4. An Introduction to Data Mining by Prof. S. Sudarshan CSE Dept, IIT Bombay
  5. SCHOLARPEDIA, available at http://www.scholarpedia.org/article/K-nearest_neighbor
  6. Health Care Decision Support System for Swine Flu Prediction Using Naïve Bayes Classifier Artcom international conference, 978-1-4244-8093-7
  7. Thuraisingham, B.: “A Primer for Understanding and Applying Data Mining”, IT Professional, 28-31, 2000.
  8. Pattern Classification (2nd. Edition) by R. O. Duda, P. E. Hart and D. Stork, Wiley 2002
  9. Inductive and Bayesian Learning in Medical Diagnosis Igor Kononenko University of Ljubljana.
Index Terms

Computer Science
Information Sciences

Keywords

Data mining naïve bayes KNN pattern recognition tuberculosis Machine learning