CFP last date
20 December 2024
Reseach Article

A Survey on Stable Feature Selection for Micro Array Data

Published on February 2013 by G. Baskar, P. Ponmuthuramalingam
International Conference on Research Trends in Computer Technologies 2013
Foundation of Computer Science USA
ICRTCT - Number 3
February 2013
Authors: G. Baskar, P. Ponmuthuramalingam
a59dcf5a-bbeb-4aba-b706-dd030d31b429

G. Baskar, P. Ponmuthuramalingam . A Survey on Stable Feature Selection for Micro Array Data. International Conference on Research Trends in Computer Technologies 2013. ICRTCT, 3 (February 2013), 10-13.

@article{
author = { G. Baskar, P. Ponmuthuramalingam },
title = { A Survey on Stable Feature Selection for Micro Array Data },
journal = { International Conference on Research Trends in Computer Technologies 2013 },
issue_date = { February 2013 },
volume = { ICRTCT },
number = { 3 },
month = { February },
year = { 2013 },
issn = 0975-8887,
pages = { 10-13 },
numpages = 4,
url = { /proceedings/icrtct/number3/10817-1030/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 International Conference on Research Trends in Computer Technologies 2013
%A G. Baskar
%A P. Ponmuthuramalingam
%T A Survey on Stable Feature Selection for Micro Array Data
%J International Conference on Research Trends in Computer Technologies 2013
%@ 0975-8887
%V ICRTCT
%N 3
%P 10-13
%D 2013
%I International Journal of Computer Applications
Abstract

Feature selection has recently attracted strong interest in knowledge discovery from high-dimensional data. Classification is a data mining (machine learning) technique used to predict group membership for data instances, microarray is the technology which allows researchers to gather information on various gene expression all at the same time and this techniques have been applied in many computer application. Gene selection for cancer classification is one of the most important topics in biomedical field. In this survey a common microarray classification techniques based on data mining methodology for perform both accuracy and stability measurement.

References
  1. John, G. H. , Kohavi, R. , Pfleger, K. : Irrelevant Feature and The Subset Selection Problem. In: Proceedings of the Eleventh International Conference on Machine Learning, pp. 121–129 (1994)
  2. Liu, H. , Yu, L. : Toward Integrating Feature Selection Algorithms for Classification and Clustering. IEEE Transactions on Knowledge and Data Engineering 17(4), 491– 502 (2005)
  3. Ding, C. , Peng, H. : Minimum Redundancy Feature Selection from Microarray Gene Expression Data. In: Proceedings of the Computational Systems Bioinformatics conference (CSB 2003), pp. 523–529 (2003)
  4. Yu, L. , Liu, H. : Efficient Feature Selection via Analysis of Relevance and Redundancy. Journal of Machine Learning Research 5, 1205–1224 (2004)
  5. Pepe, M. S. , Etzioni, R. , Feng, Z. , et al. : Phases of Biomarker Development for Early Detection of Cancer. J. Natl. Cancer Inst. 93, 1054–1060 (2001)
  6. T. Li, C. Zhang, and M. Ogihara, "A comparative study of feature selection and multiclass classification methods for tissue classification based on gene expression," Bioinformatics, vol. 20, pp. 2429–2437, 2004
  7. H. Liu and L. Yu, "Toward integrating feature selection algorithms for classification and clustering," IEEE Transactionson Knowledge and Data Engineering (TKDE), vol. 17, no. 4,pp. 491–502, 2005.
  8. M. Wasikowski and X. Chen, "Combating the small sample class imbalance problem using feature selection," IEEE Transactions on Knowledge and Data Engineering, vol. 22,no. 10, pp. 1388–1400, 2010.
  9. C. A. Davis, F. Gerick, V. Hintermair, et al. , "Reliablegene signatures for microarray classification: assessment ofstability and performance," Bioinformatics, vol. 22, pp. 2356– 2363, 2006.
  10. C. A. Davis, F. Gerick, V. Hintermair, et al. , "Reliable gene signatures for microarray classification: assessment of stability and performance," Bioinformatics, vol. 22, pp. 2356– 2363, 2006.
  11. T. R. Golub, D. K. Slonim, P. Tamayo, C. Huard, M. Gaasenbeek, J. P. Mesirov, H. Coller, M. L. Loh, J. R. Downing, M. A. Caligiuri, C. D. Bloomfield, and E. S. Lander, "Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring," Science, vol. 286, pp. 531-537, 1999.
  12. T. Li, C. Zhang, and M. Ogihara, "A Comparative Study of Feature Selection and Multiclass Classification Methods for Tissue Classification Based on Gene Expression," Bioinformatics, vol. 20, pp. 2429-2437, 2004.
  13. Y. Saeys, I. Inza, and P. Larranaga, "A Review of Feature Selection Techniques in Bioinformatics," Bioinformatics, vol. 23, no. 19, pp. 2507-2517, 2007.
  14. H. Liu, J. Li, and L. Wong, "A Comparative Study on Feature Selection and Classification Methods Using Gene Expression Profiles and Proteomic Patterns," Genome Informatics, vol. 13, pp. 51-60, 2002
  15. P. A. Mundra and J. C. Rajapakse, "SVM-RFE with MRMR Filter for Gene Selection," IEEE Trans. Nano Bioscience, vol. 9, no. 1, pp. 31- 37, Mar. 2010
  16. I. H. Witten and E. Frank, Data Mining - Practical Machine Learning Tools and Techniques. Morgan Kaufmann Publishers, 2005.
  17. B. Y. Rubinstein, Simulation and the Monte Carlo Method. John Wiley & Sons, 1981
  18. Y. Tang, Y. Q. Zhang, and Z. Huang, "Development Two-Stage SVM-RFE Gene Selection Strategy for Microarray Expression Data Analysis," IEEE/ACM Trans. Computational Biology and Bioinformatics, vol. 4, no. 3, pp. 365- 381, July 2007.
  19. Pawan Lingras, Chad West. Interval set Clustering of Web users with Rough k-Means, submitted to the Journal of Intelligent Information System in 2002.
  20. Yeung K. Y, Haynor D. R, Ruzzo W. L. Validating clustering for gene expression data. Bioinformatics. 2001
Index Terms

Computer Science
Information Sciences

Keywords

Data Mining Feature Selection Stability Microarray Classification