Title :
PSST... the probabilistic sequence search tool
Author :
Miller, Crispin J. ; Attwood, Teresa K.
Author_Institution :
Dept. of Comput. Sci., Manchester Univ., UK
Abstract :
Whole genome comparison and clustering cannot be routinely performed without access to significant resources. If as expected, repositories continue to grow at the current rate, increasingly large and expensive systems will be required in order to maintain the status quo. The high-proportion of uncharacterised gene-sequences, combined with the fact that the majority of sequence analysis techniques are alignment-based, raises the possibility that alternative approaches might be able to identify relationships that have otherwise been missed. There is a need for alternative ways to predict function. PSST is an analysis tool with parallels to both pairwise algorithms and multiple motif-based pattern approaches. It is significantly faster than BLAST, and for some families including GPCRs, the tool is more sensitive and selective as well. For others it is worse. This paper describes the algorithm, its implementation, its evaluation against a diverse set of protein families, and discusses the reasons behind its behaviour
Keywords :
biology computing; genetics; sequences; PSST; Probabilistic Sequence Search Tool; gene sequences; multiple motif-based pattern approaches; pairwise algorithms; whole genome clustering; whole genome comparison; Algorithm design and analysis; Biochemistry; Bioinformatics; Birth disorders; Computer science; Databases; Genomics; Hip; Proteins; Sequences;
Conference_Titel :
Bioinformatics and Bioengineering Conference, 2001. Proceedings of the IEEE 2nd International Symposium on
Conference_Location :
Bethesda, MD
Print_ISBN :
0-7695-1423-5
DOI :
10.1109/BIBE.2001.974409