DocumentCode
1991256
Title
An Intelligent System for Searching Genomic Sequences
Author
Gurnmuluru, V. ; Chen, Su-Shing
Author_Institution
Floirda Univ., Gainesville
fYear
2007
fDate
14-17 Oct. 2007
Firstpage
982
Lastpage
986
Abstract
In this paper, we have developed an intelligent system for searching comparative genomic sequences which departs from the traditional sequence alignment methods of nucleic residues or alphabets. Instead, we use the composition vector method that exploits pattern structures in sequences and indexing techniques for building a genomic database of prokaryotic organisms and their phylogenetic relationships. For the structural analysis of prokaryotic patterns, we use this composition vector to express various fuzzy sequence pattern queries on genomic data that would be difficult to represent in traditional database technology. B.L. Hao and his group have used the composition vector method to construct a phylogenetic tree of prokaryotes to understand the evolutionary history of prokaryotic organisms. The composition vector method is based on counting the frequency of nucleotides of a fixed length K in the collection of gene sequences of each species. This method transforms variable length sequences to a fixed length vector. In addition to elaborating on the composition vector method, we also dwell on the sequence pattern queries, the implementation with its reasoning before we finally wrap up with a discussion which we are sure will kindle some more thoughts and views to progress this work.
Keywords
biological techniques; cellular biophysics; genetic algorithms; genetics; intelligent networks; molecular biophysics; composition vector method; fuzzy sequence pattern queries; gene sequences; genomic database; genomic sequences; indexing; intelligent system; nucleotides; pattern structures; phylogenetic relationships; prokaryotes phylogenetic tree; prokaryotic organisms; prokaryotic patterns; Bioinformatics; Buildings; Databases; Genomics; History; Indexing; Intelligent systems; Organisms; Pattern analysis; Phylogeny; K-string; composition vector; frequency vector; sequence pattern queries;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Bioengineering, 2007. BIBE 2007. Proceedings of the 7th IEEE International Conference on
Conference_Location
Boston, MA
Print_ISBN
978-1-4244-1509-0
Type
conf
DOI
10.1109/BIBE.2007.4375677
Filename
4375677
Link To Document