International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 128 - Number 16 |
Year of Publication: 2015 |
Authors: Munina Yusufu, Gulina Yusufu |
10.5120/ijca2015906752 |
Munina Yusufu, Gulina Yusufu . Efficient Algorithm for Extracting Complete Repeats from Biological Sequences. International Journal of Computer Applications. 128, 16 ( October 2015), 33-37. DOI=10.5120/ijca2015906752
In this paper, an approach for efficiently extracting the repeating patterns in a biological sequence is proposed. A repeating pattern is a subsequence which appears more than once in a sequence, which is one of the most important features that can be used for revealing functional or evolutionary relationships in biological sequences. The algorithm does a rapid scan of the string to find repeating regions where the repeating substring has been marked using length, occurrence positions, and occurrence frequency. The algorithm execute in linear time and space independent of alphabet size. The algorithm also has the capability to restrict output complete repeats in which length (period) p ≥ pmin, where pmin ≥ 1 is a user-specified minimum. The algorithm outputs complete repeats, and can be extended or applied to other situations, for example computing maximal repeats, or finding common motifs in a set of biological sequences.