CFP last date
20 January 2025
Reseach Article

An Ensemble Approach to Classifier Construction based on Bootstrap Aggregation

by Dewan Md. Farid, Mohammad Zahidur Rahman, Chowdhury Mofizur Rahman
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 25 - Number 5
Year of Publication: 2011
Authors: Dewan Md. Farid, Mohammad Zahidur Rahman, Chowdhury Mofizur Rahman
10.5120/3027-4098

Dewan Md. Farid, Mohammad Zahidur Rahman, Chowdhury Mofizur Rahman . An Ensemble Approach to Classifier Construction based on Bootstrap Aggregation. International Journal of Computer Applications. 25, 5 ( July 2011), 30-34. DOI=10.5120/3027-4098

@article{ 10.5120/3027-4098,
author = { Dewan Md. Farid, Mohammad Zahidur Rahman, Chowdhury Mofizur Rahman },
title = { An Ensemble Approach to Classifier Construction based on Bootstrap Aggregation },
journal = { International Journal of Computer Applications },
issue_date = { July 2011 },
volume = { 25 },
number = { 5 },
month = { July },
year = { 2011 },
issn = { 0975-8887 },
pages = { 30-34 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume25/number5/3027-4098/ },
doi = { 10.5120/3027-4098 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:10:58.803214+05:30
%A Dewan Md. Farid
%A Mohammad Zahidur Rahman
%A Chowdhury Mofizur Rahman
%T An Ensemble Approach to Classifier Construction based on Bootstrap Aggregation
%J International Journal of Computer Applications
%@ 0975-8887
%V 25
%N 5
%P 30-34
%D 2011
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In this paper, we introduce a new approach to the classification of streaming data based on bootstrap aggregation (bagging). The proposed approach creates an ensemble model by using ID3 classifier, naïve Bayesian classifier, and k-Nearest-Neighbor classifier for a learning scheme where each classifier gives the weighted prediction. ID3, naïve Bayesian, and k-Nearest-Neighbor classifiers are very well known data mining methods, which have been already used in many real life classification problems. The proposed approach addresses the practical problems of the classification of streaming data and successfully tested on a number of benchmark problems including large intrusion detection dataset from the UCI machine learning repository to produce a comparison with the established approaches. The experimental results demonstrate that the proposed ensemble classifier achieved high classification rates and generated very low misclassification error.

References
  1. L. Breiman, “Bagging predictors,” Machine Learning, Vol. 24, 1996, pp. 123-140.
  2. B. Efron, and R. Tibshirani, “An Introduction to the Bootstrap,” Chapman & Hall, 1993.
  3. R. Kohavi, “A study of cross-validation and bootstrap for accuracy estimation and model selection,” In Proc. of the 14th Joint International Conference Artificial Intelligence (IJCAI’95), Vol. 2, August 1995, Montreal, Canada, pp. 1137-1143.
  4. U. Fayyad, G. Piatetsky-Shapiro, and P. Smyth, “From data mining to knowledge discovery: An overview,” In Advances in Knowledge Discovery and Data Mining, MIT Press, 1996.
  5. S. Pang, S. Ozawa, and N. Kasabov, “Incremental linear discriminant analysis for classification of data streams,” IEEE Trans. Syst. Man Cybern. B, Cybern., Vol. 35, No. 5, Oct. 2005, pp. 905-914.
  6. R. Jin, and G. Agrawal, “Efficient decision tree construction on streaming data,” in Proc. ACM SIGKDD, 2003, pp. 571-576.
  7. L. Breiman, J. Friedman, C. Stone, and R. Olshen, “Classification and Regression Trees,” Boca Raton, Fl: Chapman & Hall, 1993.
  8. K. M. A. Chai, H. T. Ng, and H. L. Chieu, “Bayesian online classifiers for text classification and filtering,” in Proc. SIGIR 2002, Tampere, Finland, Aug. 11-15, pp. 97-104.
  9. C. M. Bishop, “Neural Networks for Pattern Recognition,” Oxford, U.K.: Oxford University Press, 1995.
  10. The Archive UCI Machine Learning Datasets. http://archive.ics.uci.edu/ml/datasets/
  11. J. R. Quinlan, “Induction of Decision Tree,” Machine Learning Vol. 1, pp. 81-106, 1986.
  12. P. Langley, “Induction of recursive Bayesian classifier,” In Proc. of the European Conference on Machine Learning, 1993, pp. 153-164.
  13. P. Langely, W. Iba, and K. Thomas, “An analysis of Bayesian classifier,” In Proc. of the 10th National Conference on Artificial Intelligence, San Mateo, CA: AAAI Press, 1992, pp. 223-228.
  14. Z. Zheng, and I. G. Webb, “Lazy learning of Bayesian roles,” Machine Learning-1, Kluwer Academic Publishers, Boston, 2000, pp. 1-35.
  15. B. V. Dasarathy, “Nearest Neighor (NN) Norms: NN Pattern Classification Techniques,” IEEE Computer Society Press, 1991,
  16. Duda, R., P.E. Hart, and D.G. Stork, “Pattern classification,” Second edn. John Wiley & Sons, 2001.
  17. The KDD Archive. KDD99 cup dataset, 1999. http://kdd.ics.uci.edu/databases/kddcup99/kddcup99.html.
Index Terms

Computer Science
Information Sciences

Keywords

Bagging ID3 Classifier Naïve Bayesian Classifier k-Nearest-Neighbor Classifier Classification Rate