DocumentCode :
3410595
Title :
Locating all tandem repeat families in a sequence
Author :
Adjeroh, Donald ; Feng, Jianan
Author_Institution :
West Virginia Univ., Morgantown, WV, USA
fYear :
2004
fDate :
16-19 Aug. 2004
Firstpage :
676
Lastpage :
681
Abstract :
We present a new data structure called the BSCP (block sorted common prefix), and its tree representation, called the BSCP tree. We also introduce the notion of PTR family - a biologically motivated description and representation of the tandem repetitions hi a sequence. The PTR family implicitly encodes each distinct primitive tandem repeat in the sequence as its part. Based on the BSCP tree, we describe a method to locate all the primitive tandem repeat families in an input sequence T. The proposed method requires average space and time complexity in O(u), where u = |T|.
Keywords :
biology computing; computational complexity; genetics; molecular biophysics; tree data structures; average space complexity; average time complexity; block sorted common prefix tree; data structure; genomic sequence; primitive tandem repeat families; Bioinformatics; Biological information theory; Computer science; Data structures; Diseases; Engineering profession; Genetics; Genomics; US Department of Energy; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
Print_ISBN :
0-7695-2194-0
Type :
conf
DOI :
10.1109/CSB.2004.1332544
Filename :
1332544
Link To Document :
بازگشت