Title :
Predicting Protein-Protein Interaction Based on Fisher Scores Extracted from Domain Profiles
Author :
Patel, Tapan ; Liao, Li
Author_Institution :
Univ. of Delaware Newark, Newark
Abstract :
In this work, we propose a machine learning method to identify protein-protein interacting partners based on domain level knowledge that can take into account information about the interaction sites. The general approach is to use the profile hidden Markov models of protein domains and the known interactions between domains to train a support vector machine. Proteins are characterized by the vectors of fisher scores that are obtained from comparing the protein sequences to the hidden Markov model for a given domain. Protein pairs, represented by concatenation of their respective fisher score vectors, are classified as interacting partners and non interacting partners by a trained SVM. By selecting the fisher scores based on a profile hidden Markov model that differentiates the interaction sites from other residues within the domain, we demonstrated that the prediction accuracy was significantly improved, as measured in a series of cross validation experiments.
Keywords :
hidden Markov models; learning (artificial intelligence); molecular biophysics; proteins; support vector machines; SVM; cross validation; fisher score vector; interacting partners; machine learning; profile hidden Markov models; protein-protein interaction; residues; support vector machine; Accuracy; Biology computing; Data mining; Hidden Markov models; Learning systems; Predictive models; Protein engineering; Sequences; Support vector machine classification; Support vector machines; feacture selelction; fisher scores; profile hidden Markov models; protein-protein interaction; support vector machines;
Conference_Titel :
Bioinformatics and Bioengineering, 2007. BIBE 2007. Proceedings of the 7th IEEE International Conference on
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4244-1509-0
DOI :
10.1109/BIBE.2007.4375672