• DocumentCode
    950305
  • Title

    Toward a systematic definition of protein function that scales to the genome level: defining function in terms of interactions

  • Author

    Lan, Ning ; Jansen, Ronald ; Gerstein, Mark

  • Author_Institution
    Dept. of Molecular Biophys. & Biochem., Yale Univ., New Haven, CT, USA
  • Volume
    90
  • Issue
    12
  • fYear
    2002
  • fDate
    12/1/2002 12:00:00 AM
  • Firstpage
    1848
  • Lastpage
    1858
  • Abstract
    The ultimate goal of functional genomics is to elucidate the function of all the genes in the genome. However the current notions of function are crafted for individual proteins. The degree to which they can scale to the genomic level is not clear In this paper we review the diverse approaches to functional classification, focusing on their ability to meet this challenge of scale. Our review emphasizes a number of key parameters of the systems: their accuracy, comprehensiveness, level of standardization, flexibility, and support for data mining. We then propose an approach that synthesizes a number of the promising features of the existing systems. Our approach, which we call a function grid, is based on the notion of defining a protein´s function through molecular interactions-specifically, in terms of its probability of interaction with various ligands, the list of which can be expanded infinitely. To illustrate how our function grid can be used in genome-wide prediction of function, we construct a grid of yeast genes; combine it with other genomic information, including sequence features, structure, subcellular localization, and messenger ribonucleic acid expression; and then use decision trees and support vector machines to predict deoxyribonucleic acid binding.
  • Keywords
    decision trees; genetics; learning automata; proteins; reviews; deoxyribonucleic acid binding prediction; gene interactions; genome-wide function prediction; ligands interaction probabilities; messenger ribonucleic acid expression; molecular interactions; ontology; protein function; proteome; scaling to genome level; subcellular localization; support vector machines; system key parameters; systematic definition; yeast genes grid; Bioinformatics; Cells (biology); DNA; Fungi; Genomics; Molecular biophysics; Organisms; Proteins; RNA; Sequences;
  • fLanguage
    English
  • Journal_Title
    Proceedings of the IEEE
  • Publisher
    ieee
  • ISSN
    0018-9219
  • Type

    jour

  • DOI
    10.1109/JPROC.2002.805302
  • Filename
    1058229