Title :
Identifying essential proteins based on protein domains in protein-protein interaction networks
Author :
Jianxin Wang ; Wei Peng ; Yingjiao Chen ; Yu Lu ; Yi Pan
Author_Institution :
Sch. of Inf. Sci. & Eng., Central South Univ., Changsha, China
Abstract :
Prediction of essential proteins which are crucial to an organism survival is important for disease analysis and drug design, as well as the understanding of cellular life. The majority of prediction methods infer the possibility of proteins to be essential by using the network topology. However, these methods are limited to the complementation of available protein-protein interaction (PPI) data and depend on the network accuracy. To overcome these limitation, some computational methods have been proposed while seldom of them solve this problem by taking consideration of protein domains. In this work, we firstly analyze the correlation between the essentiality of proteins and their domain features based on data of 13 species. We find that the proteins containing more protein domain types which rarely occur in other proteins tend to be essential. Accordingly we propose a new prediction method, named UDoNC, by combining the domain features of proteins with their topological properties in PPI network. In UDoNC, the essentiality of proteins is decided by the number and the frequency of their protein domain types, as well as the essentiality of their adjacent edges measured by edge clustering coefficient. The experimental results on S. cerevisiae data show that UDoNC outperforms other existing methods in terms of area under the curve (AUC).
Keywords :
bioinformatics; cellular biophysics; diseases; drugs; network topology; pattern clustering; proteins; proteomics; AUC; PPI network; S. cerevisiae data; UDoNC; adjacent edges; area under the curve; cellular life; computational method; disease analysis; drug design; edge clustering coefficient; essential protein identification; network accuracy; network topology; organism survival; prediction method; protein domain features; protein domain type frequency; protein domain type number; protein essentiality; protein-protein interaction data; protein-protein interaction networks; topological properties; Essential proteins; protein domain; protein-protein interaction networks;
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2013 IEEE International Conference on
Conference_Location :
Shanghai
DOI :
10.1109/BIBM.2013.6732476