DocumentCode :
3103541
Title :
Application of Function Domain and Pseudo Amino Acid Composition to Predict Hetero-Oligomer Protein Structural Types
Author :
Xiao, Xuan ; Wang, Pu
Author_Institution :
Sch. of Mech. & Electron. Eng., Jing-De-Zhen Ceramic Inst., Jing-De-Zhen, China
fYear :
2010
fDate :
18-20 June 2010
Firstpage :
1
Lastpage :
4
Abstract :
With the avalanche of protein sequences generated in the post-genomic age, it is highly desirable to develop an automated method by which crystallographic scientists can rapidly and effectively identify which quaternary attribute a particular protein chain has according to its sequence information. Given most of the previous studies are limited to homo-oligomers, in this paper, we will try to identify the quaternary attribute of hetero-oligomer proteins. For a hetero-oligomer, its type will be identified among the following six categories: (1) heterodimer, (2) heterotrimer, (3) heterotetramer, (4) heteropentamer, (5) heterohexamer, (6) heterooctamer. Using machine learning approach, the Fuzzy Nearest Neighbor Algorithm (FKNN), we developed a prediction system for protein quaternary structural type in which we incorporated functional domain composition (FunD) and pseudo-amino acid composition (PseAA). The overall accuracy achieved by this system is more than 80% in the Jack-knife test. Such a technique should improve the success rate of structural biology projects.
Keywords :
biological techniques; biology computing; fuzzy logic; genomics; learning (artificial intelligence); molecular biophysics; molecular configurations; proteins; Jack-knife test; functional domain composition; fuzzy nearest neighbor algorithm; hetero-oligomer protein structural types; heterodimer; heterohexamer; heterooctamer; heteropentamer; heterotetramer; heterotrimer; machine learning approach; post-genomic age; protein chain; protein quaternary structural type; protein sequences; pseudo amino acid composition; structural biology projects; Amino acids; Ceramics; Crystallography; Fuzzy systems; In vivo; Machine learning; Machine learning algorithms; Nearest neighbor searches; Protein engineering; Sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedical Engineering (iCBBE), 2010 4th International Conference on
Conference_Location :
Chengdu
ISSN :
2151-7614
Print_ISBN :
978-1-4244-4712-1
Electronic_ISBN :
2151-7614
Type :
conf
DOI :
10.1109/ICBBE.2010.5515624
Filename :
5515624
Link To Document :
بازگشت