• DocumentCode
    1991256
  • Title

    An Intelligent System for Searching Genomic Sequences

  • Author

    Gurnmuluru, V. ; Chen, Su-Shing

  • Author_Institution
    Floirda Univ., Gainesville
  • fYear
    2007
  • fDate
    14-17 Oct. 2007
  • Firstpage
    982
  • Lastpage
    986
  • Abstract
    In this paper, we have developed an intelligent system for searching comparative genomic sequences which departs from the traditional sequence alignment methods of nucleic residues or alphabets. Instead, we use the composition vector method that exploits pattern structures in sequences and indexing techniques for building a genomic database of prokaryotic organisms and their phylogenetic relationships. For the structural analysis of prokaryotic patterns, we use this composition vector to express various fuzzy sequence pattern queries on genomic data that would be difficult to represent in traditional database technology. B.L. Hao and his group have used the composition vector method to construct a phylogenetic tree of prokaryotes to understand the evolutionary history of prokaryotic organisms. The composition vector method is based on counting the frequency of nucleotides of a fixed length K in the collection of gene sequences of each species. This method transforms variable length sequences to a fixed length vector. In addition to elaborating on the composition vector method, we also dwell on the sequence pattern queries, the implementation with its reasoning before we finally wrap up with a discussion which we are sure will kindle some more thoughts and views to progress this work.
  • Keywords
    biological techniques; cellular biophysics; genetic algorithms; genetics; intelligent networks; molecular biophysics; composition vector method; fuzzy sequence pattern queries; gene sequences; genomic database; genomic sequences; indexing; intelligent system; nucleotides; pattern structures; phylogenetic relationships; prokaryotes phylogenetic tree; prokaryotic organisms; prokaryotic patterns; Bioinformatics; Buildings; Databases; Genomics; History; Indexing; Intelligent systems; Organisms; Pattern analysis; Phylogeny; K-string; composition vector; frequency vector; sequence pattern queries;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering, 2007. BIBE 2007. Proceedings of the 7th IEEE International Conference on
  • Conference_Location
    Boston, MA
  • Print_ISBN
    978-1-4244-1509-0
  • Type

    conf

  • DOI
    10.1109/BIBE.2007.4375677
  • Filename
    4375677