CFP last date
20 December 2024
Reseach Article

A Framework for Feature Selection using Data Value Metric and Genetic Algorithm

by Ojie Deborah Voke, Akazue Maureen, Imianvan Anthony
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 184 - Number 43
Year of Publication: 2023
Authors: Ojie Deborah Voke, Akazue Maureen, Imianvan Anthony
10.5120/ijca2023922533

Ojie Deborah Voke, Akazue Maureen, Imianvan Anthony . A Framework for Feature Selection using Data Value Metric and Genetic Algorithm. International Journal of Computer Applications. 184, 43 ( Jan 2023), 14-21. DOI=10.5120/ijca2023922533

@article{ 10.5120/ijca2023922533,
author = { Ojie Deborah Voke, Akazue Maureen, Imianvan Anthony },
title = { A Framework for Feature Selection using Data Value Metric and Genetic Algorithm },
journal = { International Journal of Computer Applications },
issue_date = { Jan 2023 },
volume = { 184 },
number = { 43 },
month = { Jan },
year = { 2023 },
issn = { 0975-8887 },
pages = { 14-21 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume184/number43/32596-2023922533/ },
doi = { 10.5120/ijca2023922533 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T01:23:52.462824+05:30
%A Ojie Deborah Voke
%A Akazue Maureen
%A Imianvan Anthony
%T A Framework for Feature Selection using Data Value Metric and Genetic Algorithm
%J International Journal of Computer Applications
%@ 0975-8887
%V 184
%N 43
%P 14-21
%D 2023
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Most organizations analyse input data to develop an accurate description or model using the features present in the data. There have been huge amount of generated data in the big data ecosystem which demand better and efficient ways to distil high utility or value from it so as to compliment decision makers in making recommendations and decisions. Before applying classification algorithm, relevant features are selected by a suitable feature selection algorithm. Data Value Metric (DVM) is an information theoretic measure based on the notion of mutual information which has been shown to be a good metric to validate the quality and utility of data in a big data ecosystem and in traditional data. Data Value Metric (DVM) suffers from local minima and loss of diversity in the population since it is using forward selection search strategy, however, hybridizing it with Genetic Algorithm is hoped to overcome the problem of local minima as there would be a blend of evolutionary search to ensure a balance between exploration and exploitation of the search space. This paper proposed the hybrid model of Genetic Algorithm and Data Value Metric (DVM) as an information theoretic metric for quantifying the quality and utility for feature selection which can be applied to traditional data.

References
  1. LeskovecJ., Rajaraman A. and David J. (2014),. Dimensionality Reduction. in Mining of Massive Datasets, New York, Cambridge University Press, pp. 415-447.
  2. Chandrashekar G. and Sahin F.(2014). A survey on feature selection methods. Computers and Electrical Engineering, vol. 40, no. 1, pp. 16-28, 2014.
  3. Lin C. H., ChenH. Y. and. WuY. S (2014). Study of image retrieval and classification based on adaptive features using genetic algorithm feature selection," Expert Systems with Applications, vol. 41, no. 15, pp. 6611-6621.
  4. Lu J., Zhao T. and ZhangY., (2008). Feature selection based-on genetic algorithm for imageannotation," Knowledge-Based Systems, vol. 21, no. 8, pp. 887-891.
  5. Kushwaha P. And Welekar R.(2016). Feature Selection for Image Retrieval based on Genetic Algorithm. International Journal of Interactive Multimedia and Artificial Intelligence, vol. 4, no. 16, pp. 16-21.
  6. Huang H., WuY., ChanY. and LinC.(2010). Study on image feature selection: A genetic algorithm approach," in IET International Conference on Frontier Computing. Theory, Technologies and Applications, Taichung.
  7. Gómez F. and Quesada A.(2017). Genetic algorithms for feature selection in Data Analytics.[Online]. Available: https://www.neuraldesigner.com/blog/genetic_algorithms_for_ feature _selection. [Accessed 05 12 2021].
  8. Sivasankar E. and Rajesh R. S. (2012). Design and development of efficient feature Selection and classification techniques for Clinical decision support system," Shodhganga, Tirunalveli.
  9. LiangL., PengJ. and YangB.(2012). Image Feature Selection Based on Genetic Algorithm. In International Conference on Information Engineering and Applications, Chongqing
  10. SahinerB., ChanH., WeiD., Petrick N., Helvie M. A., Adler D. D. and Goodsitt M. (1996).Image feature selection by a genetic algorithm: Application to classification of mass and Normal breast tissue. Medical Physics, vol. 23, no. 10, pp. 1671-1684.
  11. M. Mitchell, An Introduction to Genetic Algorithms, Cambridge, MA: MIT Press, 1998.
  12. Noshed M, Choi, sun y, hero, A and Iyo, D (20201)a dta value metric for quantifying information . Retrieved 12/8/22, at the journal of bigdata. Springeropen .com>...
  13. S. Khuri, Genetic Algorithms, Helsinki, 2017.
  14. Adam Woznica, Phong Nguyen, Alexandros Kalousis (2012) .Model mining for robust feature selection., KDD '12 Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM New York, NY, USA, PP 913-921, 2012.
  15. Yu Land LiuH.(2004). Efficient Feature Selection via Analysis of Relevance and Redundancy,. J. Mach. Learn. Res., vol. 5, pp.1205–1224.
  16. Sourabh Katoch, Sumit Singh Chauhan and Vogay Kumar(2021). A review on genetic algorithm : Past, Present and Future. Multimedia tools and Applications. 80:8091-8026.
Index Terms

Computer Science
Information Sciences

Keywords

Genetic Algorithm Data Value Metric feature selection search local minima.