Title :
Whole genome phylogeny based on clustered signature string composition
Author :
Wu, Xiaomeng ; Lin, Guohui ; Wan, Xiu-Feng ; Xu, Dong
Author_Institution :
Dept. of Comput. Sci., Alberta Univ., Edmonton, Alta., Canada
Abstract :
Peptide compositions constructed out of whole sets of protein sequences can be used as species signatures for phylogenetic analysis. To account for point mutations, an amino acid substitution model is integrated into the complete composition vectors through a novel peptide clustering algorithm. Such a refined signature is expected to highlight deeper evolutionary relationships among the species and employed into the whole genome phylogenetic analysis to define a new evolutionary distance measure. Computational experiments have been set up to validate the effectiveness of this new measure and a vertebrate evolutionary tree using a dataset of 832 proteins for 64 vertebrates is reported.
Keywords :
biochemistry; biology computing; evolution (biological); genetics; molecular biophysics; pattern clustering; proteins; statistical analysis; amino acid substitution model; clustered signature string composition; composition vectors; evolutionary distance measure; genome phylogeny; peptide clustering algorithm; peptide composition; phylogenetic analysis; point mutation; protein sequence; vertebrate evolutionary tree; Amino acids; Bioinformatics; Frequency; Genetic mutations; Genomics; Information analysis; Peptides; Phylogeny; Proteins; Sequences;
Conference_Titel :
Computational Systems Bioinformatics Conference, 2005. Workshops and Poster Abstracts. IEEE
Print_ISBN :
0-7695-2442-7
DOI :
10.1109/CSBW.2005.143