DocumentCode
3109598
Title
A Network-Based Approach for Protein Functions Prediction Using Locally Linear Embedding
Author
Zhao, Haifeng ; Sun, Dengdi ; Wang, Rifeng ; Luo, Bin
Author_Institution
Key Lab. of Intell. Comput. & Signal Process., Anhui Univ., Hefei, China
fYear
2010
fDate
18-20 June 2010
Firstpage
1
Lastpage
4
Abstract
Inferring protein functions from different data sources is a challenging task in the post-genomic era, as a large number of crude protein structures from structural genomics project are now solved without their biochemical functions characterized. Recently, many different methods have been used to predict protein functions including those based on Protein-Protein Interaction (PPI), structure, sequence relationship, gene expression data, etc. Among these approaches, methods based on protein interaction data are very promising. In this paper, we studied a network-based method using locally linear embedding (LLE). LLE is a robust learning algorithm that manipulates dimensionality reduction, neighborhood-preserving embedding for high-dimensional data. We first embed both annotated and unannotated proteins in a low dimensional Euclidean space; then, we apply semi-supervised learning techniques to classify unannotated proteins into different functional groups. Finally, we made predictions to the unknown functional proteins in yeast. 5-fold cross validation is then applied to the GO terms to compare the performance of different approaches, and the proposed method performs significantly better than the others.
Keywords
biology computing; genomics; learning (artificial intelligence); molecular biophysics; proteins; 5-fold cross validation; biochemical functions; gene expression; locally linear embedding; low dimensional Euclidean space; protein functions prediction; protein structures; protein-protein interaction; semisupervised learning; sequence relationship; structural genomics; yeast; Bioinformatics; Computer networks; Gene expression; Genomics; Intelligent networks; Intelligent structures; Learning systems; Protein sequence; Semisupervised learning; Sun;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedical Engineering (iCBBE), 2010 4th International Conference on
Conference_Location
Chengdu
ISSN
2151-7614
Print_ISBN
978-1-4244-4712-1
Electronic_ISBN
2151-7614
Type
conf
DOI
10.1109/ICBBE.2010.5515908
Filename
5515908
Link To Document