• DocumentCode
    464297
  • Title

    Prediction of Enzyme Catalytic Sites from Sequence Using Neural Networks

  • Author

    Pande, Swati ; Raheja, Amar ; Livesay, Dennis R.

  • Author_Institution
    Dept. of Biol. Sci., California State Polytech. Univ., Pomona, CA
  • fYear
    2007
  • fDate
    1-5 April 2007
  • Firstpage
    247
  • Lastpage
    253
  • Abstract
    The accurate prediction of enzyme catalytic sites remains an open problem in bioinformatics. Recently, several structure-based methods have become popular; however, few robust sequence-only methods have been developed. In this report, we demonstrate that three different feed forward neural networks, trained on a variety of sequence-based properties, can reliably predict enzyme catalytic sites. To the best of our knowledge, this is only the second report using neural networks to predict catalytic sites, and is the first relying solely on sequence-derived information. Scaled conjugate gradient is used during training of the models. The simplest of the models uses only sequence conservation, diversity of position and residue identity within the input. Surprisingly, model accuracy is largely unaffected when sequence-based predictions of structural properties (i.e. solvent accessibility and secondary structure) are added to the input. A similar lack of improvement is observed when evolutionary information in the form of phylogenetic motifs is included. These results are noteworthy because they indicate that routine neural network architectures can accurately predict catalytic using only residue identity and conservation inputs. However, applying these methods on a per protein basis still produces a significant number of false positives, which significantly reduces the model´s utility to experimentalists.
  • Keywords
    biology computing; enzymes; feedforward neural nets; bioinformatics; enzyme catalytic sites; feedforward neural networks; phylogenetic motifs; scaled conjugate gradient; Biochemistry; Bioinformatics; Feedforward neural networks; Feeds; Neural networks; Phylogeny; Predictive models; Proteins; Robustness; Solvents;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Bioinformatics and Computational Biology, 2007. CIBCB '07. IEEE Symposium on
  • Conference_Location
    Honolulu, HI
  • Print_ISBN
    1-4244-0710-9
  • Type

    conf

  • DOI
    10.1109/CIBCB.2007.4221230
  • Filename
    4221230