International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 85 - Number 8 |
Year of Publication: 2014 |
Authors: C. Arunkumar, S. Ramakrishnan |
10.5120/14864-3239 |
C. Arunkumar, S. Ramakrishnan . Two Step Feature Extraction Method for Microarray Cancer Data using Support Vector Machines. International Journal of Computer Applications. 85, 8 ( January 2014), 34-42. DOI=10.5120/14864-3239
Diagnosis of cancer is one of the most emerging clinical applications in microarray gene expression data. However, cancer classification on microarray gene expression data still remains a difficult problem. The main reason for this is the significantly large number of genes present relatively compared to the number of available training samples. In this paper, a novel approach to feature extraction combining the statistical t-test and absolute scoring is proposed for achieving better classification rate. Suitable classification approaches using the linear Support Vector Machines, the Proximal Support Vector Machines and the Newton Support Vector Machines is also discussed. A comparative analysis on the different techniques for feature extraction is also presented. Microarray cancer data based on Adenoma and Carcinoma with 7086 and 7457 genes of 4 and 18 patients respectively is used for this study. Increase in the classification rate of the proposed new method is clearly demonstrated in the results.