DocumentCode
1855525
Title
A genome signature based on Markov modeling
Author
Li, Jian ; Sayood, Khalid
Author_Institution
Dept. of Electr. Eng., Nebraska Univ., Lincoln, NE
fYear
2005
fDate
22-25 May 2005
Lastpage
6
Abstract
We propose a "genome signature" for bacterial genomes based on a triplets Markov model. Without the alignment or data preprocessing required by traditional analysis methods, the model is shown to efficiently capture identifying genomic information at both species and strain levels. Based on the model, a simple assumption-free distance measure is proposed for constructing phylogeny trees. The approach avoids problems with word frequency approaches such as balancing word length and window size. The method is shown to work successfully with both bacterial whole genome data and individual eukaryotic genes. Application of the model to phylogenetic analysis is presented
Keywords
Markov processes; genetics; microorganisms; trees (mathematics); bacterial genomes; bacterial whole genome data; distance measure; eukaryotic genes; genome signature; genomic information; phylogenetic analysis; phylogeny trees; triplets Markov model; Bioinformatics; Capacitive sensors; DNA; Frequency; Genomics; Microorganisms; Organisms; Phylogeny; Proteins; Sequences;
fLanguage
English
Publisher
ieee
Conference_Titel
Electro Information Technology, 2005 IEEE International Conference on
Conference_Location
Lincoln, NE
Print_ISBN
0-7803-9232-9
Type
conf
DOI
10.1109/EIT.2005.1627006
Filename
1627006
Link To Document