DocumentCode :
1550674
Title :
A New Efficient Algorithm for the Gene-Team Problem on General Sequences
Author :
Wang, Biing-Feng ; Kuo, Chung-Chin ; Liu, Shang-Ju ; Lin, Chien-Hsin
Author_Institution :
Dept. of Comput. Sci., Nat. Tsing Hua Univ., Hsinchu, Taiwan
Volume :
9
Issue :
2
fYear :
2012
Firstpage :
330
Lastpage :
344
Abstract :
Identifying conserved gene clusters is an important step toward understanding the evolution of genomes and predicting the functions of genes. A famous model to capture the essential biological features of a conserved gene cluster is called the gene-team model. The problem of finding the gene teams of two general sequences is the focus of this paper. For this problem, He and Goldwasser had an efficient algorithm that requires O(mn) time using O(m + n) working space, where m and n are, respectively, the numbers of genes in the two given sequences. In this paper, a new efficient algorithm is presented. Assume m ≤ n. Let C = ΣαϵΣ o1(α)o2(α), where Σ is the set of distinct genes, and o1(α) and o2(a) are, respectively, the numbers of copies of a in the two given sequences. Our new algorithm requires O(min{C lg n,mn}) time using O(m + n) working space. As compared with He and Goldwasser´s algorithm, our new algorithm is more practical, as C is likely to be much smaller than mn in practice. In addition, our new algorithm is output sensitive. Its running time is O(lg n) times the size of the output. Moreover, our new algorithm can be efficiently extended to find the gene teams of k general sequences in O(k C Ig (n1n2...nk)) time, where ni is the number of genes in the ith input sequence.
Keywords :
biology computing; data structures; genetics; genomics; Goldwasser algorithm; biological features; gene clusters; gene-team problem; general sequences; genomes; genomics; Algorithm design and analysis; Bioinformatics; Biological cells; Biological system modeling; Clustering algorithms; Helium; Manganese; Algorithms; comparative genomics; conserved gene clusters.; data structures; gene teams; Algorithms; Animals; Conserved Sequence; Genome; Humans; Models, Genetic; Multigene Family; Sequence Alignment;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2011.96
Filename :
5871587
Link To Document :
بازگشت