Title :
Discovery of repetitive patterns in DNA with accurate boundaries
Author :
Zheng, Jie ; Lonardi, Stefano
Author_Institution :
Dept. of Comput. Sci. & Eng., California Univ., Riverside, CA, USA
Abstract :
The accurate identification of repeats remains a challenging open problem in bioinformatics. Most existing methods of repeat identification either depend on annotated repeat databases or restrict repeats to pairs of similar sequences that are maximal in length. The fundamental flaw in most of the available methods is the lack of a definition that correctly balances the importance of the length and the frequency. In this paper, we propose a new definition of repeats that satisfies both criteria. We give a novel characterization of the building blocks of repeats, called elementary repeats, which leads to a natural definition of repeat boundaries. We design efficient algorithms and test them on synthetic and real biological data. Experimental results show that our method is highly accurate.
Keywords :
DNA; biology computing; molecular biophysics; molecular configurations; DNA sequences; accurate boundaries; annotated repeat databases; bioinformatics; elementary repeats; repeat identification; repetitive DNA patterns; restrict repeats; Bioinformatics; Biological information theory; DNA; Diseases; Frequency; Genetics; Genomics; Humans; Libraries; Sequences;
Conference_Titel :
Bioinformatics and Bioengineering, 2005. BIBE 2005. Fifth IEEE Symposium on
Print_ISBN :
0-7695-2476-1
DOI :
10.1109/BIBE.2005.23