DocumentCode :
2526962
Title :
Predicting functional gene-links from phylogenetic-statistical analyses of whole genomes
Author :
Barker, Daniel ; Pagel, Mark
Author_Institution :
Sch. of Animal & Microbial Sci., Reading Univ., UK
fYear :
2005
fDate :
8-11 Aug. 2005
Firstpage :
51
Lastpage :
52
Abstract :
We describe a novel application of computational phylogenetic approaches to predict functional linkage among proteins, using proteomes derived from whole genome sequence data. The methods detect independent instances of the correlated gain or loss of pairs of genes on branches of a phylogenetic tree, on the assumption that functionally linked genes are often gained and lost at approximately the same time during evolution. According to this view, several correlated gain and/or loss events between a pair of genes suggests the gene products are functionally linked. We implement this approach using Dollo parsimony and maximum likelihood (ML) to seek correlated evolution among 21 eukaryotic species. We compare these approaches to each other and to the existing method of phylogenetic profiles, which seeks an across-species correlation but does not explicitly incorporate a phylogenetic tree. We assess all methods according to a positive test set of functionally linked protein pairs based on the MIPS catalogue of yeast protein complexes, and a negative test set of random protein pairs. Both Dollo parsimony and ML are able to achieve far greater specificity than the existing method of phylogenetic profiles. We show that ML is by far the best approach, provided that an appropriate model is used. Best results are obtained if the rate of gain of genes is fixed at a low value, to prevent modeling of multiple gains. With such a model, proteins with strong ML evidence of correlated evolution among eukaryotes are almost certainly functionally linked.
Keywords :
biochemistry; biology computing; cellular biophysics; evolution (biological); evolutionary computation; genetics; maximum likelihood estimation; molecular biophysics; proteins; Dollo parsimony; MIPS catalogue; computational phylogenetic approach; correlated gain event; correlated loss events; eukaryotic species; evolution; gene-links; genome sequence data; maximum likelihood method; negative test set; phylogenetic tree; phylogenetic-statistical analysis; proteins; proteomes; yeast protein complex; Animals; Bioinformatics; Computer applications; Couplings; Fungi; Genomics; Maximum likelihood detection; Phylogeny; Proteins; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Systems Bioinformatics Conference, 2005. Workshops and Poster Abstracts. IEEE
Print_ISBN :
0-7695-2442-7
Type :
conf
DOI :
10.1109/CSBW.2005.110
Filename :
1540536
Link To Document :
بازگشت