DocumentCode
478508
Title
A Novel Measurement of Sequence Dissimilarity and Its Application to Phylogeny
Author
Niu, Xiaohui ; Li, Nana ; Shi, Feng ; Li, Xueyan
Author_Institution
Coll. of Sci., Huazhong Agric. Univ., Wuhan
Volume
6
fYear
2008
fDate
18-20 Oct. 2008
Firstpage
231
Lastpage
234
Abstract
We present a new computational approach to measure the distance between two biological sequences. A biological sequence quantifies as a Markov Chain with 20 states. Stochastic state transition matrix is computed as the quantitative index of the biological sequence. The Kullback-Leibler discrimination information is used as a diversity indicator to measure the dissimilarity of each pair of the rows in the two state transition matrix. Distance between the two sequences is defined as the average value with the weight of the occurrence possibility of each amino acid. We illustrate its application in reconstructing a phylogeny of the Eutherian orders using concatenated H-stranded amino acid sequences. This phylogeny is consistent with the commonly accepted one for the Eutherians.
Keywords
Markov processes; biology computing; matrix algebra; proteins; H-stranded amino acid sequences; Markov chain; biological sequences; diversity indicator; phylogeny; sequence dissimilarity; stochastic state transition matrix; two state transition matrix; Agriculture; Amino acids; Biology computing; Concatenated codes; Educational institutions; Frequency; Phylogeny; Protein sequence; Statistical distributions; Stochastic processes; Kullback-Leibler discrimination information; Measurement of Sequence Dissimilarity; Phylogeny Tree;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Computation, 2008. ICNC '08. Fourth International Conference on
Conference_Location
Jinan
Print_ISBN
978-0-7695-3304-9
Type
conf
DOI
10.1109/ICNC.2008.299
Filename
4667835
Link To Document