Survey on Swarm Search Feature Selection for Big Data Stream Mining

S. Meera; B. Rosiline Jeetha

Call for Paper

October Edition

IJCA solicits high quality original research papers for the upcoming October edition of the journal. The last date of research paper submission is 22 September 2025

Submit your paper

Know more

The week's pick

Real-Time Video Transmission using Gaussian Minimum Shift Keying (GMSK) on GNU Radio and USRP for Radiation Monitoring Applications in Nuclear Reactors

Nabiha Ben Abid Abdalla M. Khattab Hani A.M. Harb Chokri Souani

Random Articles

Reseach Article

Survey on Swarm Search Feature Selection for Big Data Stream Mining

by S. Meera, B. Rosiline Jeetha

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 158 - Number 1

Year of Publication: 2017

Authors: S. Meera, B. Rosiline Jeetha

10.5120/ijca2017912720

S. Meera, B. Rosiline Jeetha . Survey on Swarm Search Feature Selection for Big Data Stream Mining. International Journal of Computer Applications. 158, 1 ( Jan 2017), 11-16. DOI=10.5120/ijca2017912720

@article{ 10.5120/ijca2017912720,

author = { S. Meera, B. Rosiline Jeetha },

title = { Survey on Swarm Search Feature Selection for Big Data Stream Mining },

journal = { International Journal of Computer Applications },

issue_date = { Jan 2017 },

volume = { 158 },

number = { 1 },

month = { Jan },

year = { 2017 },

issn = { 0975-8887 },

pages = { 11-16 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume158/number1/26871-2017912720/ },

doi = { 10.5120/ijca2017912720 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-07T00:03:39.062911+05:30

%A S. Meera

%A B. Rosiline Jeetha

%T Survey on Swarm Search Feature Selection for Big Data Stream Mining

%J International Journal of Computer Applications

%@ 0975-8887

%V 158

%N 1

%P 11-16

%D 2017

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Now days, there are more number of corporations are gathering a more number of information, frequently produced incessantly as a series of measures and approaching from different types of positions. Big data defines a knowledge used to record and execute the data set and it has the structured, semi structured and unstructured data that has to be mined for valuable data. On the other hand, mining through the high dimensional data the search space from which an optimal feature subset is determined and it is enhanced in size, guiding to a difficult stipulate in computation. With respect to handle the troubles, the research work is generally based on the high-dimensionality and streaming structure of data feeds in big data, a new inconsequential feature selection methodology that can be used to identify the feature selection methods in the big data. Some of the research work illustrates the different kinds of optimization methods for data stream mining would lead to tremendous changes in big data. This research work is focused on discussing various research methods that focus on finding the efficient feature selection methods which is used to avoid main challenges and produce optimal solutions. The previous methods are described with their advantages and disadvantages, consequently that the additional research works can be focused more. The tentative experiments were on the entire research works in Mat lab simulation surroundings and it is differentiated with everyone to identify the good methodologies beneath the different performance measures.

References

Alelyani, S., Zhao, Z and Liu, H., 2011. “A dilemma in assessing stability of feature selection algorithms”, in IEEE 13th International Conference on High Performance Computing and Communications (HPCC), 701–707.
Minku, L.L., White A.P and X. Yao, 2010. “The impact of Diversity on online ensemble learning in the presence of concept drift”, 22(5):730–742.
Fong and Simon, 2014. “A Scalable data stream mining methodology: stream-based holistic analytics and reasoning in parallel”, Computational and Business Intelligence (ISCBI), 2014 2nd International Symposium.
Ping-Feng Pai and Tai-Chi Chen, 2009. Rough set theory with discriminant analysis in analyzing electricity loads", Expert Systems with Applications 36:8799–880.
Guyon, I and Elisseeff, A., 2003. “An Introduction to Variable and Feature Selection”, Journal of Machine Learning Research, 3: 1157- 1182.
Chakraborty and Basabi, 2014. “Rough fuzzy consistency measure with evolutionary algorithm for attribute reduction”, 2014 IEEE International Conference on Systems, Man, and Cybernetics (SMC).
Bishop, C.M., 2006. Pattern Recognition and Machine Learning, Springer.
Tani, Fauzia Yasmeen, Dewan Md Farid and Mohammad Zahidur Rahman, 2012. “Ensemble of Decision Tree Classifiers for Mining Web Data Streams”, International Journal of Applied Information Systems, 30-36 .
Akioka and Sayaka, 2013. “Task Graphs of Stream Mining Algorithms”.
Yu, Kui, 2014. “Towards scalable and accurate online feature selection for big data”, 2014 IEEE International Conference on Data Mining.
Tekin, Cem, Luca Canzian and Mihaela Van Der Schaar, 2014. “Context-adaptive big data stream mining”, Communication, Control, and Computing (Allerton).
Ruta and Dymitr, 2014, “Robust method of sparse feature selection for multi-label classification with Naive Bayes”, Computer Science and Information Systems (FedCSIS).
Vu and Anh Thu, 2014. “Distributed adaptive model rules for mining big data streams”, Big Data (Big Data).
Fong and Simon, 2014. "A Scalable data stream mining methodology: stream-based holistic analytics and reasoning in parallel”, Computational and Business Intelligence (ISCBI).
Shivani Harde and Vaishali Sahare, 2015. “ACO Swarm Search Feature Selection for Data stream Mining in Big Data”, International Journal of Innovative Research in Computer and Communication Engineering, 3(12).
Wang and Chanpaul, J., 2015. “A novel initialization method for particle swarm optimization-based FCM in big biomedical data”.
Fong, Simon, Raymond Wong and Athanasios V. Vasilakos, 2016. “Accelerated PSO swarm search feature selection for data stream mining big data”, IEEE Transactions on Services Computing, 33-45.

Index Terms

Computer Science

Information Sciences

Keywords

Big Data Feature Selection Particle Swarm Optimization Classification