CFP last date
20 January 2025
Reseach Article

Preprocessing of Streaming Data using Genetic Algorithm

by Ketan Desale, Roshani Ade
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 120 - Number 17
Year of Publication: 2015
Authors: Ketan Desale, Roshani Ade
10.5120/21319-4324

Ketan Desale, Roshani Ade . Preprocessing of Streaming Data using Genetic Algorithm. International Journal of Computer Applications. 120, 17 ( June 2015), 16-19. DOI=10.5120/21319-4324

@article{ 10.5120/21319-4324,
author = { Ketan Desale, Roshani Ade },
title = { Preprocessing of Streaming Data using Genetic Algorithm },
journal = { International Journal of Computer Applications },
issue_date = { June 2015 },
volume = { 120 },
number = { 17 },
month = { June },
year = { 2015 },
issn = { 0975-8887 },
pages = { 16-19 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume120/number17/21319-4324/ },
doi = { 10.5120/21319-4324 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:06:28.406727+05:30
%A Ketan Desale
%A Roshani Ade
%T Preprocessing of Streaming Data using Genetic Algorithm
%J International Journal of Computer Applications
%@ 0975-8887
%V 120
%N 17
%P 16-19
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In today's world data is rapidly and continuously growing and is not constant in nature. There is a problem to deal with such kind of evolving data, as it is impractical to store and process this streaming data. Also, in real world application, the data which is coming is typically noisy, has some missing values, redundant features, and thus very large time is wasted to preprocess that data. The time complexity can reduce by selecting only useful features to build model for classification. The proposed system addresses the issue of adaptive preprocessing for streaming data. Here Genetic algorithm (GA) is used as a search method while selecting the features which will further use in learning model. The proposed system is applied to different stream datasets and is showing significant increment in classification accuracy.

References
  1. Albert Bifet. 2009Adaptive Learning and Mining for Data Streams and Frequent Patterns. Doctoral Thesis, Universitat Politecnica de Catalunya
  2. Roshani Ade 2014 Instance based vs Batch based incremental learning approach for Students Classification. International Journal of Computer Application, Foundation of Computer Science, USA, vol. 106, no. 3
  3. Roshani Ade 2014 Classification of students by using an incremental ensemble of classifiers. 3rd IEEE International Conference On Reliability, Infocom Technologies and optimization, pp. 61-65, ICRITO- 8-10
  4. Indre Zliobaite, Bogdan Gabrys, "Adaptive Preprocessing for Streaming Data", IEEE Trans. Knowledge and Data Engg. , vol. 26, no. 2, pp. 309- 321, Feb. 2014
  5. S Aksoy 2008 Feature Reduction and Selection Department of Computer Engineering, Bilkent University, 2008, CS 551
  6. B. Kavitha, S. Karthikeyan and B. Chitra 2010 Efficient Intrusion Detection with Reduced Dimension Using Data Mining classification Methods and Their Performance Comparison CCIS 70, pp. 96-101
  7. Mouaad KEZIH, Mahmoud TAIBI 2013 "Evaluation Effectiveness of Intrusion Detection System with Reduced Dimension Using Data Mining Classification Tools", 2nd International Conference on Systems and Computer Science (ICSCS) , August 26-27
  8. Wei Li 2004 "Using Genetic Algorithm for Network Intrusion Detection", Proceedings of the United States Department of Energy Cyber Security Grou, Training Conference, Vol. 8, pp. 24-27.
  9. Anup Goyal, Chetan Kumar, "GA-NIDS: A Genetic Algorithm based Network Intrusion Detection System".
  10. Bharat S. Dhak, Shrikant Lade, "An Evolutionary Approach to Intrusion Detection System using Genetic Algorithm", International Journal of Emerging Technology and Advanced Engineering, Volume 2, Issue 12, December 2012.
  11. K. S. Desale, Rohani Ade, "Genetic Algorithm based Feature Selection Approach for Effective Intrusion Detection System", 2015 International Conference on Computer Communication and Informatics (ICCCI - 2015), Jan. 08 10, 2015, Coimbatore, INDIA
  12. NSL-KDD dataset for network-based intrusion detection systems available on http://iscx. info/NSL-KDD
  13. Airlines data available on http://moa. cms. waikato. ac. nz/datasets/
Index Terms

Computer Science
Information Sciences

Keywords

Genetic Algorithm streaming data preprocessing