Title :
IUP: Intrinsically Unstructured Protein predictor - A software tool for analyzing polypeptide sequences
Author :
Yang, Mary Qu ; Yang, Jack Y.
Author_Institution :
Dept. of Health & Human Services, Nat. Inst. of Health, Bethesda, MD
Abstract :
Many protein regions and some entire proteins have no definite tertiary structure, presenting instead as dynamic, disorder ensembles under different physiochemical circumstances. These proteins and regions are known as intrinsically unstructured proteins (IUP). IUP have been associated with a wide range of protein functions and play essential roles in diseases characterized by protein misfolding and aggregation. Identifying IUP is an important but difficult task in today´s structural and functional genomics. We exact useful features from polypeptide sequences and develop machine learning algorithms for the above task. We compare our IUP predictor with PONDRreg (mainly neural-network-based predictors), disEMBL (also based on neural networks) and Globplot (based on IUP propensity). We find that augmenting features derived from physiochemical properties of amino acids (such as 9-gram encoding scheme and hydrophobicity) and using ensemble methods proved beneficial. The IUP predictor is a viable alternative software tool for identifying IUP regions and proteins
Keywords :
biochemistry; biological techniques; biology computing; genetics; learning (artificial intelligence); molecular biophysics; molecular configurations; neural nets; proteins; 9-gram encoding scheme; Globplot; IUP predictor; IUP propensity; PONDR; amino acids; disEMBL; ensemble methods; genomics; hydrophobicity; intrinsic unstructured protein predictor; machine learning algorithms; neural-network-based predictors; physiochemical properties; polypeptide sequences; protein aggregation; protein functions; protein misfolding; software tool; Amino acids; Bioinformatics; Diseases; Genomics; Humans; Lifting equipment; Nuclear magnetic resonance; Protein engineering; Software tools; Spectroscopy;
Conference_Titel :
BioInformatics and BioEngineering, 2006. BIBE 2006. Sixth IEEE Symposium on
Conference_Location :
Arlington, VA
Print_ISBN :
0-7695-2727-2
DOI :
10.1109/BIBE.2006.253309