• DocumentCode
    394147
  • Title

    Protein sequences classification using radial basis function (RBF) neural networks

  • Author

    Wang, Dianhui ; Lee, Wung Kion ; Dillon, Tharam S. ; Hoogenraad, N.J.

  • Author_Institution
    Dept. of Comput. Sci. & Comput. Eng., La Trobe Univ., Melbourne, Vic., Australia
  • Volume
    2
  • fYear
    2002
  • fDate
    18-22 Nov. 2002
  • Firstpage
    764
  • Abstract
    A protein super-family consists of proteins which share amino acid sequence homology and which may therefore be functionally and structurally related. Traditionally, two protein sequences are classified into the same class if they have high homology in terms of feature patterns extracted through sequence alignment algorithms. As the sizes of the protein sequence databases are very large, it is a very time consuming job to perform exhaustive comparison of existing protein sequence. Therefore, there is a need to build an improved classification system for effectively identifying protein sequences. This paper presents a modular neural classifier for protein sequences with improved classification criteria. The intelligent classification techniques described in the paper aims to enhance the performance of single neural classifiers based on a centralized information structure in terms of recognition rate generalization and reliability. The architecture of the proposed model is a modular RBF neural network with a compensational combination at the transition output layer. The connection weights between the final output layer and the transition output layer are optimized by delta rule, which serve as an integrator of the local neural classifiers. To enhance the classification reliability, we present two heuristic rules to apply to decisionmaking. Two sets of protein sequences with ten classes of superfamilies downloaded from a public domain database, Protein Information Resources (PIR), are used in our simulation study. Experimental results with performance comparisons are carried out between single neural classifiers and the proposed modular neural classifier.
  • Keywords
    biology computing; pattern classification; proteins; radial basis function networks; Protein Information Resources; amino acid sequence homology; centralized information structure; classification reliability; classification system; compensational combination; connection weights; delta rule; feature pattern extraction; final output layer; heuristic rules; intelligent classification techniques; local neural classifiers; modular RBF neural network; modular neural classifier; protein sequence classification; protein sequence databases; protein super-family; public domain database; radial basis function neural networks; sequence alignment algorithms; single neural classifiers; transition output layer; Classification tree analysis; Cognitive science; Computer science; Decision trees; Feature extraction; Humans; Neural networks; Protein engineering; Protein sequence; Spatial databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Information Processing, 2002. ICONIP '02. Proceedings of the 9th International Conference on
  • Print_ISBN
    981-04-7524-1
  • Type

    conf

  • DOI
    10.1109/ICONIP.2002.1198161
  • Filename
    1198161