CFP last date
20 January 2025
Reseach Article

Artificial Neural Network based String Matching Algorithms for Species Classification – A Preliminary Study and Experimental Results

by Sathish Kumar S, N. Duraipandian
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 52 - Number 14
Year of Publication: 2012
Authors: Sathish Kumar S, N. Duraipandian
10.5120/8270-1832

Sathish Kumar S, N. Duraipandian . Artificial Neural Network based String Matching Algorithms for Species Classification – A Preliminary Study and Experimental Results. International Journal of Computer Applications. 52, 14 ( August 2012), 21-29. DOI=10.5120/8270-1832

@article{ 10.5120/8270-1832,
author = { Sathish Kumar S, N. Duraipandian },
title = { Artificial Neural Network based String Matching Algorithms for Species Classification – A Preliminary Study and Experimental Results },
journal = { International Journal of Computer Applications },
issue_date = { August 2012 },
volume = { 52 },
number = { 14 },
month = { August },
year = { 2012 },
issn = { 0975-8887 },
pages = { 21-29 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume52/number14/8270-1832/ },
doi = { 10.5120/8270-1832 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:52:14.092767+05:30
%A Sathish Kumar S
%A N. Duraipandian
%T Artificial Neural Network based String Matching Algorithms for Species Classification – A Preliminary Study and Experimental Results
%J International Journal of Computer Applications
%@ 0975-8887
%V 52
%N 14
%P 21-29
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The preliminary research in the area of applications of neural networks and pattern matching algorithms in species classification is presented. Artificial neural networks for classification and different pattern matching algorithms for matching the given DNA patterns or strings with the existing DNA sequences available in the databases are specifically studied. A set of local searching algorithms were experimented for different test string lengths and their time complexity is tabulated. Conclusions and future directions are also presented.

References
  1. The website and Glossary maintained by Amateur Entomologists Society
  2. Zoheir Ezziane, Applications of artificial intelligence in bioinformatics: A review, Expert Systems with Applications 30 (2006) 2–10
  3. Jiawei Han and Micheline Kamber, Data Mining: Concepts and Techniques, Second Edition, Morgan Kaufmann publications, 2006.
  4. Arthur Lesk, Introduction to Bioinformatics, Oxford University Press, USA, 3 Edition, 2008.
  5. Simon Haykin, Neural Networks a Comprehensive Foundation, Prentice Hall Publications, II Edition, 1998.
  6. NeuralWare (2003), NeuralWorks Predict® Getting Started Guide for Windows, Carnegie, PA.
  7. Dr Yeshpal Singh and Alok Singh Chauhan, "Neural Networks in Data Mining" Journal of Theoretical and Applied Information Technology, 2005 - 2009
  8. Kwong-Sak Leung, Ka-Chun Wong, Tak-Ming Chan, Man-Hon Wong, Kin-Hong Lee, Chi-Kong Lau, and Stephen K. W. Tsui, . Discovering protein–DNA binding sequence patterns using association rule mining, Nucleic Acids Res. 2010 October; 38(19): 6324–6337. Published online 2010 June 6.
  9. Felix Autenrieth, Barry Isralewitz, Zaida Luthey-Schulten, Anurag Sethi, Taras Pogorelov, Bioinformatics and Sequence Alignment, June 2005
  10. Thompson JD, Plewniak F, Poch O. (1999). "A comprehensive comparison of multiple sequence alignment programs". Nucleic Acids Res 27 (13): 2682–90.
  11. Boyer, Robert S. ; Moore, J Strother (October 1977). "A Fast String Searching Algorithm. ". Comm. ACM (New York, NY, USA: Association for Computing Machinery) 20 (10): 762–772.
  12. CROCHEMORE, M. , HANCART, C. , 1999, Pattern Matching in Strings, in Algorithms and Theory of Computation Handbook, M. J. Atallah ed. , chapter 11, pp 11-1--11-28, CRC Press Inc. , Boca Raton, FL
  13. HORSPOOL R. N. , 1980, Practical fast searching in strings, Software - Practice & Experience, 10(6):501-506.
  14. Evolutionary Computation 2 - Advanced Algorithms and Operations, edited by Thomas Baeck, D. B Fogel, Z Michalewicz, Taylor & Francis; I edition, November 2000
  15. Timothy Masters, Advanced algorithms for neural networks: a C++ sourcebook, Volume 1, Wiley, I edition, Apr-1995
  16. Kiran P, S Sathish Kumar and Dr Kavya N P, A Novel Framework using Elliptic Curve Cryptography for Extremely Secure Transmission in Distributed Privacy Preserving Data Mining, Advanced Computing: An International Journal ( ACIJ ), Vol. 3, No. 2, March 2012
  17. Jianbo Gao, Yan Qi, Yinhe Cao, and Wen-wen Tung, "Protein Coding Sequence Identification by Simultaneously Characterizing the Periodic and Random Features of DNA Sequences", Journal of Biomedicine and Biotechnology, Vol. 2, pp. 139–146, 2005.
  18. Shital Shah and Andrew Kusiak, "Cancer gene search with data-mining and genetic algorithms, Computers in Biology and Medicine", Vol. 37, No. 2, pp. 251-261, February 2007
  19. Riccardo Bellazzi and Blaz Zupan, "Towards knowledge-based gene expression data mining", Journal of Biomedical Informatics, Vol. 40, No. 6, pp. 787-802, December 2007
  20. Fayyad, Piatetsky-Shapiro, Smyth and Uthurusamy, "Advances in knowledge discovery and data mining", AAAI/MIT Press, 1995
  21. Kiran P, Sathish Kumar S and Dr Kavya N P, An Extended Conceptual Modelling for ETL Processes in Privacy Preserving Data Mining, International Conference on Computing and Computer Vision (ICCCV 2012).
  22. Jianbo Gao, Yan Qi, Yinhe Cao, and Wen-wen Tung, "Protein Coding Sequence Identification by Simultaneously Characterizing the Periodic and Random Features of DNA Sequences", Journal of Biomedicine and Biotechnology, Vol. 2, pp. 139–146, 2005.
  23. Cormen, Thomas H. ; Leiserson, Charles E. ; Rivest, Ronald L. ; Stein, Clifford (2001-09-01). "The Rabin–Karp algorithm". Introduction (2nd ed. ). Cambridge, Massachusetts: MIT Press. pp. 911–916.
  24. Aho, Alfred V. ; Margaret J. Corasick (June 1975). "Efficient string matching: An aid to bibliographic search". Communications of the ACM 18 (6): 333–340
  25. CROCHEMORE, M. , LECROQ, T. , 1996, Pattern matching and text compression algorithms, in CRC Computer Science and Engineering Handbook, A. Tucker ed. , Chapter 8, pp 162-202, CRC Press Inc. , Boca Raton, FL.
  26. NAVARRO G. , RAFFINOT M. , 1998. A Bit-Parallel Approach to Suffix Automata: Fast Extended String Matching, In Proceedings of the 9th Annual Symposium on Combinatorial Pattern Matching, Lecture Notes in Computer Science 1448, Springer-Verlag, Berlin, 14-31.
  27. Needleman, Saul B. ; and Wunsch, Christian D. (1970). "A general method applicable to the search for similarities in the amino acid sequence of two proteins". Journal of Molecular Biology 48 (3): 443–53 28. http://www. cs. tau. ac. il/~rshamir/algmb/98/scribe/html/ lec02/node10. html
  28. P. D. Michailidis and K. G. Margaritis On-line String Matching Algorithms: Survey and Experimental Results, International Journal of Computer Mathematics, Vol. 76, No. 4. (2001), pp. 411-434
  29. E. W. T. Ngai, Yong Hu, Y. H. Wong, Yijun Chen, Xin Sun, "The application of datamining techniques in financial fraud detection: A classification framework and an academic review of literature" Decision Support Systems, Volume 50, Issue 3, February 2011, Pages 559–569
  30. Arzu ?encan ?ahin, ?smail ?lke Köse & Re?at Selba, "Comparative analysis of neural network and neuro – fuzzy system for thermodynamic properties of refrigerants" Applied Artificial Intelligence: An International Journal, Volume 26, Issue 7, 2012, DOI: 10. 1080/08839514. 2012. 701427
Index Terms

Computer Science
Information Sciences

Keywords

Alignment ANN DNA sequencing Species classification String matching