• DocumentCode
    1617468
  • Title

    A Genome Signature Based on Markov Modeling

  • Author

    Li, Jian ; Sayood, Khalid

  • Author_Institution
    Dept. of Electr. Eng., Nebraska Univ., Lincoln, NE
  • fYear
    2006
  • Firstpage
    2832
  • Lastpage
    2835
  • Abstract
    We propose a "genome signature" for bacterial genomes based on a triplets Markov model. Without the alignment or data preprocessing required by traditional analysis methods, the model is shown to efficiently capture identifying genomic information at genus, species and strain levels. Based on the model, a simple distance measure is proposed for constructing phylogeny trees. Unlike other genome signatures based on word frequency with problems balancing word length and window size, the method has been shown to work successfully with both bacterial whole genome data and individual eukaryotic genes. Applications of the model to phylogenetic analysis and sequence fragment identification are presented
  • Keywords
    Markov processes; biology computing; cellular biophysics; genetics; physiological models; Markov modeling; eukaryotic genes; genome; phylogenetic analysis; phylogeny trees; sequence fragment identification; Bioinformatics; Capacitive sensors; Data preprocessing; Frequency; Genomics; Information analysis; Microorganisms; Organisms; Phylogeny; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering in Medicine and Biology Society, 2005. IEEE-EMBS 2005. 27th Annual International Conference of the
  • Conference_Location
    Shanghai
  • Print_ISBN
    0-7803-8741-4
  • Type

    conf

  • DOI
    10.1109/IEMBS.2005.1617063
  • Filename
    1617063