DocumentCode :
1855525
Title :
A genome signature based on Markov modeling
Author :
Li, Jian ; Sayood, Khalid
Author_Institution :
Dept. of Electr. Eng., Nebraska Univ., Lincoln, NE
fYear :
2005
fDate :
22-25 May 2005
Lastpage :
6
Abstract :
We propose a "genome signature" for bacterial genomes based on a triplets Markov model. Without the alignment or data preprocessing required by traditional analysis methods, the model is shown to efficiently capture identifying genomic information at both species and strain levels. Based on the model, a simple assumption-free distance measure is proposed for constructing phylogeny trees. The approach avoids problems with word frequency approaches such as balancing word length and window size. The method is shown to work successfully with both bacterial whole genome data and individual eukaryotic genes. Application of the model to phylogenetic analysis is presented
Keywords :
Markov processes; genetics; microorganisms; trees (mathematics); bacterial genomes; bacterial whole genome data; distance measure; eukaryotic genes; genome signature; genomic information; phylogenetic analysis; phylogeny trees; triplets Markov model; Bioinformatics; Capacitive sensors; DNA; Frequency; Genomics; Microorganisms; Organisms; Phylogeny; Proteins; Sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electro Information Technology, 2005 IEEE International Conference on
Conference_Location :
Lincoln, NE
Print_ISBN :
0-7803-9232-9
Type :
conf
DOI :
10.1109/EIT.2005.1627006
Filename :
1627006
Link To Document :
بازگشت