DocumentCode :
3109598
Title :
A Network-Based Approach for Protein Functions Prediction Using Locally Linear Embedding
Author :
Zhao, Haifeng ; Sun, Dengdi ; Wang, Rifeng ; Luo, Bin
Author_Institution :
Key Lab. of Intell. Comput. & Signal Process., Anhui Univ., Hefei, China
fYear :
2010
fDate :
18-20 June 2010
Firstpage :
1
Lastpage :
4
Abstract :
Inferring protein functions from different data sources is a challenging task in the post-genomic era, as a large number of crude protein structures from structural genomics project are now solved without their biochemical functions characterized. Recently, many different methods have been used to predict protein functions including those based on Protein-Protein Interaction (PPI), structure, sequence relationship, gene expression data, etc. Among these approaches, methods based on protein interaction data are very promising. In this paper, we studied a network-based method using locally linear embedding (LLE). LLE is a robust learning algorithm that manipulates dimensionality reduction, neighborhood-preserving embedding for high-dimensional data. We first embed both annotated and unannotated proteins in a low dimensional Euclidean space; then, we apply semi-supervised learning techniques to classify unannotated proteins into different functional groups. Finally, we made predictions to the unknown functional proteins in yeast. 5-fold cross validation is then applied to the GO terms to compare the performance of different approaches, and the proposed method performs significantly better than the others.
Keywords :
biology computing; genomics; learning (artificial intelligence); molecular biophysics; proteins; 5-fold cross validation; biochemical functions; gene expression; locally linear embedding; low dimensional Euclidean space; protein functions prediction; protein structures; protein-protein interaction; semisupervised learning; sequence relationship; structural genomics; yeast; Bioinformatics; Computer networks; Gene expression; Genomics; Intelligent networks; Intelligent structures; Learning systems; Protein sequence; Semisupervised learning; Sun;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedical Engineering (iCBBE), 2010 4th International Conference on
Conference_Location :
Chengdu
ISSN :
2151-7614
Print_ISBN :
978-1-4244-4712-1
Electronic_ISBN :
2151-7614
Type :
conf
DOI :
10.1109/ICBBE.2010.5515908
Filename :
5515908
Link To Document :
بازگشت