• DocumentCode
    3265286
  • Title

    Fuzzy Profile Hidden Markov Models for Protein Sequence Analysis

  • Author

    Bidargaddi, N.P. ; Chetty, M. ; Kamruzzaman, J.

  • Author_Institution
    Gippsland School of Computing and Information Technology Faculty of Information Technology, Monash University Gippsland Campus, Churchill, VIC-3842, Australia, niranjan.bidargaddi@infotech.monash.edu.au
  • fYear
    2005
  • fDate
    14-15 Nov. 2005
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Profile HMMs based on classical hidden Markov models have been widely applied for alignment and classification of protein sequence families. The formulation of the forward and backward variables in profile HMMs is made under statistical independence assumption of the probability theory. We propose a fuzzy profile hidden Markov model to overcome the limitations of the statistical independence assumption of probability theory. The strong correlations and the sequence preference involved in the protein structures make fuzzy architecture based models as suitable candidates for building profiles of a given family since fuzzy set can handle uncertainties better than classical methods. The proposed model fuzzifies the forward and backward variables by incorporating Sugeno fuzzy measures using Choquet integrals which is extended to fuzzy Baum-Welch parameter estimation algorithm for profiles. It was built and tested on widely studied globin and kinase family sequences and its performance was compared with classical HMM. A comparative analysis based on Log-Likelihood (LL) scores of sequences and Receiver Operating Characteristic (ROC) demonstrates the superiority of fuzzy profile HMMs over the classical profile model.
  • Keywords
    Amino acids; Australia; Databases; Fuzzy sets; Hidden Markov models; Information analysis; Information technology; Parameter estimation; Probability; Protein sequence;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence in Bioinformatics and Computational Biology, 2005. CIBCB '05. Proceedings of the 2005 IEEE Symposium on
  • Print_ISBN
    0-7803-9387-2
  • Type

    conf

  • DOI
    10.1109/CIBCB.2005.1594950
  • Filename
    1594950