• DocumentCode
    1855525
  • Title

    A genome signature based on Markov modeling

  • Author

    Li, Jian ; Sayood, Khalid

  • Author_Institution
    Dept. of Electr. Eng., Nebraska Univ., Lincoln, NE
  • fYear
    2005
  • fDate
    22-25 May 2005
  • Lastpage
    6
  • Abstract
    We propose a "genome signature" for bacterial genomes based on a triplets Markov model. Without the alignment or data preprocessing required by traditional analysis methods, the model is shown to efficiently capture identifying genomic information at both species and strain levels. Based on the model, a simple assumption-free distance measure is proposed for constructing phylogeny trees. The approach avoids problems with word frequency approaches such as balancing word length and window size. The method is shown to work successfully with both bacterial whole genome data and individual eukaryotic genes. Application of the model to phylogenetic analysis is presented
  • Keywords
    Markov processes; genetics; microorganisms; trees (mathematics); bacterial genomes; bacterial whole genome data; distance measure; eukaryotic genes; genome signature; genomic information; phylogenetic analysis; phylogeny trees; triplets Markov model; Bioinformatics; Capacitive sensors; DNA; Frequency; Genomics; Microorganisms; Organisms; Phylogeny; Proteins; Sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electro Information Technology, 2005 IEEE International Conference on
  • Conference_Location
    Lincoln, NE
  • Print_ISBN
    0-7803-9232-9
  • Type

    conf

  • DOI
    10.1109/EIT.2005.1627006
  • Filename
    1627006