Title :
Selection of features from protein-protein interaction network for identifying cancer genes
Author :
Li, Yongjin ; Patra, Jagdish C.
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore
Abstract :
Cancer is a group of complex diseases, in which a relatively large number of genes are involved. One of the main goals of cancer research is to identify genes that causally relevant to the development and progress of cancer. The increasingly identified cancer genes and availability of genomic and proteomics data provide us opportunities to identify cancer genes by computational methods. In this work, we investigated five predictive topological features, derived from the protein-protein interaction networks, in identifying cancer genes. We used 10-fold cross validation to assess the predictive ability of all the combinations of these features and found the most predictive feature and feature combinations. Two kinds of neural networks, support vector machine (SVM) and multi-layer perceptrons (MLP), were employed to assess the predictive ability of features. We found that the best feature combination for these two algorithms is the same. At the same time, we found SVM performs slightly better than MLP. Using only 2 or 3 features, the best performance of our classification model can get accuracy as high as 73.9%.
Keywords :
biology computing; cancer; genetics; neural nets; support vector machines; cancer genes; cancer research; complex diseases; genomic data; multilayer perceptrons; neural networks; predictive ability; predictive topological features; protein-protein interaction network; proteomics data; support vector machine; Bioinformatics; Cancer; Diseases; Genomics; Multi-layer neural network; Multilayer perceptrons; Neural networks; Proteins; Proteomics; Support vector machines; Cancer Genes; PPI Networks; Topological Features;
Conference_Titel :
Systems, Man and Cybernetics, 2008. SMC 2008. IEEE International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-1-4244-2383-5
Electronic_ISBN :
1062-922X
DOI :
10.1109/ICSMC.2008.4811534