DocumentCode :
3265286
Title :
Fuzzy Profile Hidden Markov Models for Protein Sequence Analysis
Author :
Bidargaddi, N.P. ; Chetty, M. ; Kamruzzaman, J.
Author_Institution :
Gippsland School of Computing and Information Technology Faculty of Information Technology, Monash University Gippsland Campus, Churchill, VIC-3842, Australia, niranjan.bidargaddi@infotech.monash.edu.au
fYear :
2005
fDate :
14-15 Nov. 2005
Firstpage :
1
Lastpage :
8
Abstract :
Profile HMMs based on classical hidden Markov models have been widely applied for alignment and classification of protein sequence families. The formulation of the forward and backward variables in profile HMMs is made under statistical independence assumption of the probability theory. We propose a fuzzy profile hidden Markov model to overcome the limitations of the statistical independence assumption of probability theory. The strong correlations and the sequence preference involved in the protein structures make fuzzy architecture based models as suitable candidates for building profiles of a given family since fuzzy set can handle uncertainties better than classical methods. The proposed model fuzzifies the forward and backward variables by incorporating Sugeno fuzzy measures using Choquet integrals which is extended to fuzzy Baum-Welch parameter estimation algorithm for profiles. It was built and tested on widely studied globin and kinase family sequences and its performance was compared with classical HMM. A comparative analysis based on Log-Likelihood (LL) scores of sequences and Receiver Operating Characteristic (ROC) demonstrates the superiority of fuzzy profile HMMs over the classical profile model.
Keywords :
Amino acids; Australia; Databases; Fuzzy sets; Hidden Markov models; Information analysis; Information technology; Parameter estimation; Probability; Protein sequence;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology, 2005. CIBCB '05. Proceedings of the 2005 IEEE Symposium on
Print_ISBN :
0-7803-9387-2
Type :
conf
DOI :
10.1109/CIBCB.2005.1594950
Filename :
1594950
Link To Document :
بازگشت