CFP last date
20 January 2025
Reseach Article

Survey on Swarm Search Feature Selection for Big Data Stream Mining

by S. Meera, B. Rosiline Jeetha
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 158 - Number 1
Year of Publication: 2017
Authors: S. Meera, B. Rosiline Jeetha
10.5120/ijca2017912720

S. Meera, B. Rosiline Jeetha . Survey on Swarm Search Feature Selection for Big Data Stream Mining. International Journal of Computer Applications. 158, 1 ( Jan 2017), 11-16. DOI=10.5120/ijca2017912720

@article{ 10.5120/ijca2017912720,
author = { S. Meera, B. Rosiline Jeetha },
title = { Survey on Swarm Search Feature Selection for Big Data Stream Mining },
journal = { International Journal of Computer Applications },
issue_date = { Jan 2017 },
volume = { 158 },
number = { 1 },
month = { Jan },
year = { 2017 },
issn = { 0975-8887 },
pages = { 11-16 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume158/number1/26871-2017912720/ },
doi = { 10.5120/ijca2017912720 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:03:39.062911+05:30
%A S. Meera
%A B. Rosiline Jeetha
%T Survey on Swarm Search Feature Selection for Big Data Stream Mining
%J International Journal of Computer Applications
%@ 0975-8887
%V 158
%N 1
%P 11-16
%D 2017
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Now days, there are more number of corporations are gathering a more number of information, frequently produced incessantly as a series of measures and approaching from different types of positions. Big data defines a knowledge used to record and execute the data set and it has the structured, semi structured and unstructured data that has to be mined for valuable data. On the other hand, mining through the high dimensional data the search space from which an optimal feature subset is determined and it is enhanced in size, guiding to a difficult stipulate in computation. With respect to handle the troubles, the research work is generally based on the high-dimensionality and streaming structure of data feeds in big data, a new inconsequential feature selection methodology that can be used to identify the feature selection methods in the big data. Some of the research work illustrates the different kinds of optimization methods for data stream mining would lead to tremendous changes in big data. This research work is focused on discussing various research methods that focus on finding the efficient feature selection methods which is used to avoid main challenges and produce optimal solutions. The previous methods are described with their advantages and disadvantages, consequently that the additional research works can be focused more. The tentative experiments were on the entire research works in Mat lab simulation surroundings and it is differentiated with everyone to identify the good methodologies beneath the different performance measures.

References
  1. Alelyani, S., Zhao, Z and Liu, H., 2011. “A dilemma in assessing stability of feature selection algorithms”, in IEEE 13th International Conference on High Performance Computing and Communications (HPCC), 701–707.
  2. Minku, L.L., White A.P and X. Yao, 2010. “The impact of Diversity on online ensemble learning in the presence of concept drift”, 22(5):730–742.
  3. Fong and Simon, 2014. “A Scalable data stream mining methodology: stream-based holistic analytics and reasoning in parallel”, Computational and Business Intelligence (ISCBI), 2014 2nd International Symposium.
  4. Ping-Feng Pai and Tai-Chi Chen, 2009. Rough set theory with discriminant analysis in analyzing electricity loads", Expert Systems with Applications 36:8799–880.
  5. Guyon, I and Elisseeff, A., 2003. “An Introduction to Variable and Feature Selection”, Journal of Machine Learning Research, 3: 1157- 1182.
  6. Chakraborty and Basabi, 2014. “Rough fuzzy consistency measure with evolutionary algorithm for attribute reduction”, 2014 IEEE International Conference on Systems, Man, and Cybernetics (SMC).
  7. Bishop, C.M., 2006. Pattern Recognition and Machine Learning, Springer.
  8. Tani, Fauzia Yasmeen, Dewan Md Farid and Mohammad Zahidur Rahman, 2012. “Ensemble of Decision Tree Classifiers for Mining Web Data Streams”, International Journal of Applied Information Systems, 30-36 .
  9. Akioka and Sayaka, 2013. “Task Graphs of Stream Mining Algorithms”.
  10. Yu, Kui, 2014. “Towards scalable and accurate online feature selection for big data”, 2014 IEEE International Conference on Data Mining.
  11. Tekin, Cem, Luca Canzian and Mihaela Van Der Schaar, 2014. “Context-adaptive big data stream mining”, Communication, Control, and Computing (Allerton).
  12. Ruta and Dymitr, 2014, “Robust method of sparse feature selection for multi-label classification with Naive Bayes”, Computer Science and Information Systems (FedCSIS).
  13. Vu and Anh Thu, 2014. “Distributed adaptive model rules for mining big data streams”, Big Data (Big Data).
  14. Fong and Simon, 2014. "A Scalable data stream mining methodology: stream-based holistic analytics and reasoning in parallel”, Computational and Business Intelligence (ISCBI).
  15. Shivani Harde and Vaishali Sahare, 2015. “ACO Swarm Search Feature Selection for Data stream Mining in Big Data”, International Journal of Innovative Research in Computer and Communication Engineering, 3(12).
  16. Wang and Chanpaul, J., 2015. “A novel initialization method for particle swarm optimization-based FCM in big biomedical data”.
  17. Fong, Simon, Raymond Wong and Athanasios V. Vasilakos, 2016. “Accelerated PSO swarm search feature selection for data stream mining big data”, IEEE Transactions on Services Computing, 33-45.
Index Terms

Computer Science
Information Sciences

Keywords

Big Data Feature Selection Particle Swarm Optimization Classification