CFP last date
20 December 2024
Reseach Article

An Evaluation of Feature Selection Methods for Multiclass Learning in Bio Informatics

by Megha Purohit, Pooja Mehta
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 138 - Number 6
Year of Publication: 2016
Authors: Megha Purohit, Pooja Mehta
10.5120/ijca2016908855

Megha Purohit, Pooja Mehta . An Evaluation of Feature Selection Methods for Multiclass Learning in Bio Informatics. International Journal of Computer Applications. 138, 6 ( March 2016), 24-27. DOI=10.5120/ijca2016908855

@article{ 10.5120/ijca2016908855,
author = { Megha Purohit, Pooja Mehta },
title = { An Evaluation of Feature Selection Methods for Multiclass Learning in Bio Informatics },
journal = { International Journal of Computer Applications },
issue_date = { March 2016 },
volume = { 138 },
number = { 6 },
month = { March },
year = { 2016 },
issn = { 0975-8887 },
pages = { 24-27 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume138/number6/24385-2016908855/ },
doi = { 10.5120/ijca2016908855 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:38:58.202280+05:30
%A Megha Purohit
%A Pooja Mehta
%T An Evaluation of Feature Selection Methods for Multiclass Learning in Bio Informatics
%J International Journal of Computer Applications
%@ 0975-8887
%V 138
%N 6
%P 24-27
%D 2016
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Traditional data mining techniques such as classification or clustering have demonstrated achievement in datasets which has multiple instances in singly relation but while extreme point of dimensionality or complex dependencies presents in the data it fails to offer accuracy and correctness. In solution to this, Feature (attribute/variable) selection techniques since last two decades have verified its requisites to improve speed, prediction and reduce computational cost of machine learners. In this paper review of assorted feature selection methods named filter, wrapper and embedded with each classifier like support vector machines (SVM), averaged perceptron and neural network is presented. Additionally it conveys an assessment of which FS approach works better for which classifier for breast cancer dataset.

References
  1. [Delen D, Walker G, Kadam A, 2005]. “Predicting breast cancer survivability: a comparison of three data mining methods”, Artif IntellMed, 34:113–27.
  2. [J. Hammon, November 2013]. “Optimization combinatoire pour la sélection de variables en régression en grande dimension”: Application en génétique animale.
  3. [RandallWald, Taghi M. Khoshgoftaar]. “Optimizing Wrapper-Based Feature Selection for Use on Bioinformatics Data”, Amri Napolitano Florida Atlantic University.
  4. [T. M. Phuong, Z. Lin et R. B. Altman, 2005]. “Choosing SNPs using feature selection. Proceeding, IEEE Computational Systems Bioinformatics Conference, pages 301-309.
  5. [B. Duval, J.-K. Hao et J. C. Hernandez Hernandez, 2009]. “A memetic algorithm for gene selection and molecular classification of an cancer”. In Proceedings of the 11th Annual conference on Genetic and evolutionary computation, GECCO '09, pages 201-208, New York, NY, USA.
  6. [Yvan Saeys, Iñaki Inza and Pedro Larrañaga]. “Review of feature selection techniques in bioinformatics”.
  7. [Huihui Zhao, Jianxin Chen, Y.Liu, Qi Shi, Yi Yang, Chenglong Zheng, 2011]. “The use of feature selection based data mining methods in biomarkers identification of disease”, Elsevier, Beijing University of Chinese Medicine, China.
  8. Liu, Cutler G, Li W, Pan Z, and Peng S. "Multiclass cancer classification and biomarker discovery using GA-based algorithms." Bioinformatics 21(11) (June 2005): 2691-2697.
  9. Zhang, Min-Ling, José M. Peña, and Victor Robles. "Feature selection for multi-label naive Bayes classification." Information Sciences 179, no. 19 (2009): 3218–3229
  10. Dietterich, Thomas G., Richard H. Lathrop, and Tomás Lozano-Pérez. "Solving the multiple instance problem with axis-parallel rectangles." Artificial Intelligence 89, no. 1-2 (1997): 31–71
  11. Miller, A., B. Blott and T. Hames, 1992. Review of neural network applications in medical imaging and signal processing. Med. Biol. Engg. Comp., 30: 449-464
  12. Zaiane, Osmar R, Antonie Maria-luiza and A. Coman, 2001. Application of data mining techniques for medical image classification. Second Intl. Workshop on Multimedia Data Mining. In conjunction with ACM SIGKDD Conf. San Francisco, USA, Aug. 26
  13. Multiple Classifier Systems Lecture Notes in Computer Science Volume 3541, 2005, pp 278-285. Which Is the Best Multiclass SVM Method? An Empirical Study Kai-Bo Duan, S. Sathiya Keerthi
  14. F. Aiolli and A. Sperduti. An efficient SMO-like algorithm for multiclass SVM. In Proceedings of IEEE workshop on Neural Networks for Signal Processing, pages 297–306, 2002
  15. Weston, J. and Watkins, C., 1998, Multi-class Support Vector Machines. Royal Holloway, University of London, U. K., Technical Report CSD-TR-98-04
  16. Hsu, C.-W., and C.-J. Lin, C.-J., 2002, A comparison of methods for multi-class support vector machines, IEEE Transactions on Neural Networks, 13, 415-425.JAMES, G., 1998
  17. http://link.springer.com/article/10.1007/s10462-009-9114-9 A review on the combination of binary classifiers in multiclass problems
  18. X. Chen, X. Zeng, and D. van Alphen. Multi-class feature selection for texture classification. Pattern Recognition Letters, 27(14):1685{1691, 2006
  19. G. Madazrov and D. Gjorgjevikj. Evaluation of distance measures for multi-class classification in binary svm decision tree. In Artificial Intelligence and Soft Computing: 10th International Conference, (ICAISC), 2010.
  20. A.C.Lorena, A.C.Carvalho, J.M.Gama, are view on the combination of binary classifiers in multi-class problems, Artificial Intelligence Review30(1–4) (2008)19–37.
  21. L. J. van’t Veer, H. Dai, and M. J. van de Vijver, Gene expression profiling predicts clinical outcome of breast cancer 2002. Nature, 415:530–536
  22. Dataset:http://www.rii.com/publications/2002/vantveer.htm
Index Terms

Computer Science
Information Sciences

Keywords

Machine Learning Multi class classification Feature Selection