DocumentCode :
2564655
Title :
Protein family classification using structural and sequence information
Author :
Smith, Scott F.
Author_Institution :
Dept. of Electr. & Comput. Eng., Boise State Univ., ID, USA
fYear :
2004
fDate :
7-8 Oct. 2004
Firstpage :
168
Lastpage :
174
Abstract :
Protein family classification usually relies on sequence information (as in the case of hidden Markov models and position-specific scoring matrices) or on structural information where some sort of average positional error between the atomic locations is used. The positional error method requires that the structure of all the proteins to be classified is known. Sequence methods have the advantage that a much larger number of proteins can be classified (since far more sequences are know than structures). However, sequence methods discard a large amount of useful information contained in the structures of the subset of proteins in the family for which structures are known. A protein family classification system is presented which uses both structural and sequence information and combines this information in a way consistent with fuzzy systems theory. The nonlinear fuzzy-theory-based method is found to perform better than either an equally-weighted linear combination of the sequence and structural information or the sequence information alone.
Keywords :
biology computing; fuzzy systems; hidden Markov models; molecular biophysics; proteins; biological sequence analysis; computational molecular biology; fuzzy systems theory; hidden Markov model; nonlinear fuzzy-theory-based method; position-specific scoring matrix; protein family classification; sequence information; structural information; Amino acids; Biology computing; Computational systems biology; Computer architecture; Databases; Fuzzy systems; Hidden Markov models; Protein engineering; Protein sequence; Topology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology, 2004. CIBCB '04. Proceedings of the 2004 IEEE Symposium on
Print_ISBN :
0-7803-8728-7
Type :
conf
DOI :
10.1109/CIBCB.2004.1393950
Filename :
1393950
Link To Document :
بازگشت