International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 146 - Number 9 |
Year of Publication: 2016 |
Authors: Pamela Vinitha Eric, Kusum Rajput, Gopakumar G. |
10.5120/ijca2016910851 |
Pamela Vinitha Eric, Kusum Rajput, Gopakumar G. . An Improved Method to Identify Exact and Approximate Tandem Repeats in DNA Sequences using Biclustering. International Journal of Computer Applications. 146, 9 ( Jul 2016), 1-5. DOI=10.5120/ijca2016910851
Tandem repeats occur frequently in eukaryotic and prokaryotic genomic sequences. They are associated with several inherited human diseases, DNA fingerprinting, evolution and regulatory processes. In spite of their importance, detection of tandem repeats is still not resolved in the sense that the current existing detection tools do not give the same results for a given input sequence. This is mainly due to the differences in the methods adopted by the search algorithms and the different parameter settings needed when they are executed. This paper proposes an efficient method to identify all exact and approximate tandem repeats within a given DNA sequence and also identifies the presence of any changes brought about by mutation. The method first identifies all potential tandem repeats by clustering using K-means method, followed by biclustering to filter out the actual repeats along with the position of occurrance of approximate tandem repeats. The results obtained by this method are consistent with that of existing methods.