• DocumentCode
    3109598
  • Title

    A Network-Based Approach for Protein Functions Prediction Using Locally Linear Embedding

  • Author

    Zhao, Haifeng ; Sun, Dengdi ; Wang, Rifeng ; Luo, Bin

  • Author_Institution
    Key Lab. of Intell. Comput. & Signal Process., Anhui Univ., Hefei, China
  • fYear
    2010
  • fDate
    18-20 June 2010
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Inferring protein functions from different data sources is a challenging task in the post-genomic era, as a large number of crude protein structures from structural genomics project are now solved without their biochemical functions characterized. Recently, many different methods have been used to predict protein functions including those based on Protein-Protein Interaction (PPI), structure, sequence relationship, gene expression data, etc. Among these approaches, methods based on protein interaction data are very promising. In this paper, we studied a network-based method using locally linear embedding (LLE). LLE is a robust learning algorithm that manipulates dimensionality reduction, neighborhood-preserving embedding for high-dimensional data. We first embed both annotated and unannotated proteins in a low dimensional Euclidean space; then, we apply semi-supervised learning techniques to classify unannotated proteins into different functional groups. Finally, we made predictions to the unknown functional proteins in yeast. 5-fold cross validation is then applied to the GO terms to compare the performance of different approaches, and the proposed method performs significantly better than the others.
  • Keywords
    biology computing; genomics; learning (artificial intelligence); molecular biophysics; proteins; 5-fold cross validation; biochemical functions; gene expression; locally linear embedding; low dimensional Euclidean space; protein functions prediction; protein structures; protein-protein interaction; semisupervised learning; sequence relationship; structural genomics; yeast; Bioinformatics; Computer networks; Gene expression; Genomics; Intelligent networks; Intelligent structures; Learning systems; Protein sequence; Semisupervised learning; Sun;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedical Engineering (iCBBE), 2010 4th International Conference on
  • Conference_Location
    Chengdu
  • ISSN
    2151-7614
  • Print_ISBN
    978-1-4244-4712-1
  • Electronic_ISBN
    2151-7614
  • Type

    conf

  • DOI
    10.1109/ICBBE.2010.5515908
  • Filename
    5515908