DocumentCode :
2924614
Title :
Predicting a protein´s melting temperature from its amino acid sequence
Author :
Gorania, Malde ; Seker, Huseyin ; Haris, Parvez I.
Author_Institution :
Dept. of Inf., Bio-Health Inf. Res. Group, De Montfort Univ., Leicester, UK
fYear :
2010
fDate :
Aug. 31 2010-Sept. 4 2010
Firstpage :
1820
Lastpage :
1823
Abstract :
Melting temperature is an important characteristic feature of a protein and is used for various purposes such as in drug development. Currently protein melting temperature is determined by laboratory methods such as Differential Scanning Calorimetry, Circular Dichroism, Fourier transform infrared spectroscopy and several other methods. These methods are laborious and costly. Therefore, we propose a novel bioinformatics based method for predicting protein melting temperature from amino acid sequence of a protein. This is not only a challenging task but has been previously unexplored. For this study, melting temperature of 230 proteins from a range of organisms was collected along with their sequence information from the published literature. The melting temperature of these proteins represents a very large spectrum and varies between 25°C and 113°C. The protein sequences are then used to derive two sets of sequence-driven features, namely amino acid composition (AAC) and pseudo-amino acid composition (PseudoAAC) to characterise the proteins. In order to predict the melting temperature, two different computational intelligence methods, namely artificial neural networks (ANN) and adaptive network-fuzzy inference system (ANFIS) were utilized. Amongst over 100 different models generated, the ANN produced the best model with the least error (0.01087 for the AAC and 0.01086 for the pseudoAAC). As both feature sets yielded quite similar error and computation of pseudoAAC is costly when compared to that of AAC, traditional AAC seems to be an effective feature set for predicting melting temperature. The results obtained in this study are very promising and, for the first time, shows that the melting temperature of a protein can be predicted from its amino acid sequence only. Therefore, costly lab-based experiments may not be required to measure the melting temperature and the bioinformatics models can help speed up laboratory processes such as in drug develop- ent.
Keywords :
bioinformatics; fuzzy logic; inference mechanisms; melting point; molecular biophysics; molecular configurations; neural nets; proteins; ANFIS; adaptive network fuzzy inference system; artificial neural networks; bioinformatics based method; computational intelligence methods; protein amino acid sequence; protein melting temperature prediction; protein sequence information; pseudo-amino acid composition; pseudoAAC; sequence driven features; temperature 25 degC to 113 degC; Amino acids; Artificial neural networks; Computational modeling; Predictive models; Proteins; Stability analysis; Thermal stability; ANFIS; Artificial neural networks; adaptive neuro-fuzzy rule-based systems; amino acid composition and pseudo amino acid composition; Amino Acid Sequence; Computer Simulation; Models, Chemical; Molecular Sequence Data; Protein Denaturation; Proteins; Sequence Analysis, Protein; Structure-Activity Relationship; Transition Temperature;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering in Medicine and Biology Society (EMBC), 2010 Annual International Conference of the IEEE
Conference_Location :
Buenos Aires
ISSN :
1557-170X
Print_ISBN :
978-1-4244-4123-5
Type :
conf
DOI :
10.1109/IEMBS.2010.5626421
Filename :
5626421
Link To Document :
بازگشت