CFP last date
20 January 2025
Reseach Article

An Efficient Approach to Enhance Classifier and Cluster Ensembles Using Genetic algorithms for Mining Drifting Data Streams

by AnutoshPratap Singh, Varsha Sharma, JitendraAgrawal
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 44 - Number 21
Year of Publication: 2012
Authors: AnutoshPratap Singh, Varsha Sharma, JitendraAgrawal
10.5120/6392-8853

AnutoshPratap Singh, Varsha Sharma, JitendraAgrawal . An Efficient Approach to Enhance Classifier and Cluster Ensembles Using Genetic algorithms for Mining Drifting Data Streams. International Journal of Computer Applications. 44, 21 ( April 2012), 1-5. DOI=10.5120/6392-8853

@article{ 10.5120/6392-8853,
author = { AnutoshPratap Singh, Varsha Sharma, JitendraAgrawal },
title = { An Efficient Approach to Enhance Classifier and Cluster Ensembles Using Genetic algorithms for Mining Drifting Data Streams },
journal = { International Journal of Computer Applications },
issue_date = { April 2012 },
volume = { 44 },
number = { 21 },
month = { April },
year = { 2012 },
issn = { 0975-8887 },
pages = { 1-5 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume44/number21/6392-8853/ },
doi = { 10.5120/6392-8853 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:36:07.951993+05:30
%A AnutoshPratap Singh
%A Varsha Sharma
%A JitendraAgrawal
%T An Efficient Approach to Enhance Classifier and Cluster Ensembles Using Genetic algorithms for Mining Drifting Data Streams
%J International Journal of Computer Applications
%@ 0975-8887
%V 44
%N 21
%P 1-5
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Mining data streams is concerned with extracting knowledge structures represented in models and patterns in high-speed streams of information. It raises new problems for the data mining community in terms of how to mine continuous high-speed data items that you can only have one look. The increasing focus of applications that generate and receive data streams stimulates the need for online data stream analysis tools. Recently, mining data streams with concept drifts has become an important and challenging task for a wide range of applications such as target marketing, network intrusion detection, credit card fraud protection, etc. Clustering and classification ensemble learning is a frequently used tool for building prediction models from data streams, due to its fundamental nature of managing large volumes of stream data. These both are the tools which help in improving the performance of mining systems. In order to improve the accuracy and error rate of traditional ensemble models, we propose a new ensemble model which combines both classifiers and clusters together and utilizes genetic algorithms for mining data streams. The main reason for using genetic algorithms along with clustering and classification is its high ability to solve optimization.

References
  1. BPeng Zhang, XingquanZhu ,Jianlong Tan , Li Guo. 2007. Classifier and Cluster Ensembles for Mining Concept Drifting Data Streams. QCIS Centre, Faculty of Eng. & IT, Univ. of Technology.
  2. Reza Ghaemi, Nasir bin Sulaiman, Hamidah Ibrahim, Norwati Mustapha. 2010. A review: accuracy optimization in clustering ensembles using genetic algorithms Springer Science+Business Media B. V.
  3. Mohamed MedhatGaber, ArkadyZaslavsky and ShonaliKrishnaswamy. Mining Data Streams: A Review. Centre for Distributed Systems and Software Engineering, Monash University, 900 Dandenong Rd, Caulfield East, VIC3145, Australia.
  4. C. Shafer, R. Agrawal, and M. Mehta. 1996 Sprint: A scalable parallel classifier for data mining. In VLDB.
  5. J. Gehrke, R. Ramakrishnan, and V. Ganti. RainForest. 1998. A framework for fast decision tree construction of large datasets. In VLDB.
  6. J. Gehrke, V. Ganti, R. Ramakrishnan, and W. Loh. 1999. BOAT– optimistic decision tree construction. In SIGMOD.
  7. P. E. Utgoff. 1989. Incremental induction of decision trees Machine Learning, 4:161–186.
  8. P. Domingos and G. Hulten. 2000. Mining high-speed data streams. In SIGKDD, pages 71–80, Boston, MA, ACM Press.
  9. H. Wang, W. Fan, P. Yu, J. Han. 2003. Mining concept-drifting data streams using ensemble classifiers,In Proc. of KDD.
  10. J. Gao, W. Fan, J. Han. On appropriate assumptions to mine data streams: analysis and practice, 2003 In Proc. of ICDM.
  11. S. W. Mahfoud. 1993. Simple analytical models of genetic algorithms for multimodal function optimization. Proc. of 5th ICGA.
  12. S. W. Mahfoud. 1993. Cross over interaction among niches. Proc. of 1st IEEE Conference on Evolutionary Computation, world Congress on Computation Intelligence, 188-193, 1993. 88-193.
  13. P. K. Nanda, D. P. Muni, P. Kanungo. 2002. Parallelized crowding scheme using a new interconnectionmodel. International Conference on Fuzzy Systems, Calcutta, LNAI(2275), Springer-Verlag:436-443.
Index Terms

Computer Science
Information Sciences

Keywords

Classifier And Cluster Ensembles Using Genetic Algorithms genetic Algorithms Based Cluster And Classifier Ensembles For Mining Drifting Data Streams Mining Data Streams