Title :
A Novel Measurement of Sequence Dissimilarity and Its Application to Phylogeny
Author :
Niu, Xiaohui ; Li, Nana ; Shi, Feng ; Li, Xueyan
Author_Institution :
Coll. of Sci., Huazhong Agric. Univ., Wuhan
Abstract :
We present a new computational approach to measure the distance between two biological sequences. A biological sequence quantifies as a Markov Chain with 20 states. Stochastic state transition matrix is computed as the quantitative index of the biological sequence. The Kullback-Leibler discrimination information is used as a diversity indicator to measure the dissimilarity of each pair of the rows in the two state transition matrix. Distance between the two sequences is defined as the average value with the weight of the occurrence possibility of each amino acid. We illustrate its application in reconstructing a phylogeny of the Eutherian orders using concatenated H-stranded amino acid sequences. This phylogeny is consistent with the commonly accepted one for the Eutherians.
Keywords :
Markov processes; biology computing; matrix algebra; proteins; H-stranded amino acid sequences; Markov chain; biological sequences; diversity indicator; phylogeny; sequence dissimilarity; stochastic state transition matrix; two state transition matrix; Agriculture; Amino acids; Biology computing; Concatenated codes; Educational institutions; Frequency; Phylogeny; Protein sequence; Statistical distributions; Stochastic processes; Kullback-Leibler discrimination information; Measurement of Sequence Dissimilarity; Phylogeny Tree;
Conference_Titel :
Natural Computation, 2008. ICNC '08. Fourth International Conference on
Conference_Location :
Jinan
Print_ISBN :
978-0-7695-3304-9
DOI :
10.1109/ICNC.2008.299