CFP last date
20 December 2024
Reseach Article

Enhanced SVM based Ensemble Algorithm to Improve the Classification for High Dimensional Data

by Kavitha S., M. Hemalatha
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 132 - Number 5
Year of Publication: 2015
Authors: Kavitha S., M. Hemalatha
10.5120/ijca2015907340

Kavitha S., M. Hemalatha . Enhanced SVM based Ensemble Algorithm to Improve the Classification for High Dimensional Data. International Journal of Computer Applications. 132, 5 ( December 2015), 12-15. DOI=10.5120/ijca2015907340

@article{ 10.5120/ijca2015907340,
author = { Kavitha S., M. Hemalatha },
title = { Enhanced SVM based Ensemble Algorithm to Improve the Classification for High Dimensional Data },
journal = { International Journal of Computer Applications },
issue_date = { December 2015 },
volume = { 132 },
number = { 5 },
month = { December },
year = { 2015 },
issn = { 0975-8887 },
pages = { 12-15 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume132/number5/23589-2015907340/ },
doi = { 10.5120/ijca2015907340 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:28:20.392525+05:30
%A Kavitha S.
%A M. Hemalatha
%T Enhanced SVM based Ensemble Algorithm to Improve the Classification for High Dimensional Data
%J International Journal of Computer Applications
%@ 0975-8887
%V 132
%N 5
%P 12-15
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Microarrays are novel biotechnological technology that is being used widely in cancer research. By allowing the monitoring of expression levels in cells for thousands of genes simultaneously, microarray experiments may lead to a more complete understanding of cell’s function. This is due to the fact that the physiology of an organism is generally associated with changes in gene expression patterns, thus leading to a finer and more reliable classification. Microarray data is an arrangement of points in rows and columns. Out of the various techniques of data mining, classification and clustering are two processes that have great potential in microarray data analysis. This research work focuses on using machine learning classification algorithms for predicting the presence or absence of cancer. A classification model for microarray data analysis consists of three major steps, namely, preprocessing, gene selection and identification or prediction of genetic defect. The preprocessing step consists of cleaning algorithms like normalization, missing value handling routines which enhance the quality of the gene microarray data and help to improve the subsequent steps. Gene selection is a process where a set of informative genes is selected from the gene expression data in a form of microarray dataset.This process helps improve the performance of the classifier. The third step, classification, is a process to classify microarray data into several predefined classes that have its own characteristics.

References
  1. Alshamlan, H.M., Badr, G.H. and Alohali, Y. (2013) A study of cancer microarray gene expression profile: Objectives and approaches, Proceedings of the World Congress on Engineering, Vol. II, Pp. 1-6.
  2. Alshamlan, H.M.., Badr, G. and Alohali, Y. (2015) mRMR-ABC: A
  3. Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling, BioMed Research International, Vol. 2015, Article ID 604910, Pp. 1-15.
  4. Bolon-Canedo, V., Sanchez-Marono, N. and Betanzos, A.A. (2010) An ensemble of filters and classifiers for microarray data classification, Pattern Recognition, Vol. 45, Pp. 531-539.
  5. Boulesteix, A., Porzelius, C. and Daumer. M. (2008) Microarray-based classification and clinical predictors. Bioinformatics, Vol. 24, No. 15, Pp. 1698-1706.
  6. Chen, C.K. (2012) The classification of cancer stage microarray data, Computer Methods and Programs in Biomedicine, Vol. 108, Issue 3, Pp. 1070-1077.
  7. De Paz, J.F., Bajo, J., Vera, V., Corchado, J.M. (2011) MicroCBR: A case-based reasoning architecture for the classification of microarray data, Applied Soft Computing, Vol. 11, Issue 8, Pp. 4496-4507.
  8. Garcia-Laencina, P.J., Sancho-Gomex, J.L., Figueiras-Vidal, A.R. and Verleysoen, M. (2009) K-nearest Neighbours and mutual information for simulataneous classification and missing data imputation, Neurocomputing, Elsevier, vol. 72., pp. 1483-1493.
  9. Hassan, M.M., Dowd, A.A., Ibrahim, F.I., Mohamed, A.H., Kaheel, H.H. and Hassan, M.A.,(2014) In silico analysis of single nucleotide polymorphisms (SNPs) in human HLA-A and HLA-B genes responsible for renal transplantation rejection, European Academic Research, Vol. II, Issue 3, Pp.3627-3646.
  10. Huang, G.B., Zhu, Q.Y. and Siew, C.K. (2006) Extreme learning machine: Theory and applications, Neurocomputing, Vol. 70, Pp. 489–501.
  11. Lin, D., Shkedy, Z., Yekutieli, D., Amaratunga, D. and Bijnens, L. (2012) Modeling Dose-Response Microarray Data in Early Drug Development Experiments Using R: Order-Restricted Analysis of Microarray Data, Springer Publishing Company Incorporated.
  12. Nguyen, T., Khosravi, A., Creighton, D., & Nahavandi, S. (2015). Hierarchical Gene Selection and Genetic Fuzzy System for Cancer Microarray Data Classification. PLoS ONE, 10(3), e0120364.
  13. Revathi, T. and Sumathi, P. (2014) A Successive Feature Selection Algorithm for Gene Ranking, International Journal of Advanced Research in Computer and Communication Engineering, Vol. 3, Issue 2, Pp. 5744-5748.
  14. Sharma, L.S. (2010) Commerce education in north-east India, (Singh, R.A. Ed.), The role of information technology in Commerce, Chapter 9, Mittal Publications, Pp. 125-134.
Index Terms

Computer Science
Information Sciences

Keywords

SVM Microarraydata classification Gene Selection