DocumentCode :
893440
Title :
A Grid-Enabled Protein Secondary Structure Predictor
Author :
Mirto, Maria ; Cafaro, Massimo ; Fiore, Sandro Luigi ; Tartarini, Daniele ; Aloisio, Giovanni
Author_Institution :
Center for Adv. Computational Technol., Nat. Nanotechnology Lab.
Volume :
6
Issue :
2
fYear :
2007
fDate :
6/1/2007 12:00:00 AM
Firstpage :
124
Lastpage :
130
Abstract :
We present an integrated Grid system for the prediction of protein secondary structures, based on the frequent automatic update of proteins in the training set. The predictor model is based on a feed-forward multilayer perceptron (MLP) neural network which is trained with the back-propagation algorithm; the design reuses existing legacy software and exploits novel grid components. The predictor takes into account the evolutionary information found in multiple sequence alignment (MSA); the information is obtained running an optimized parallel version of the PSI-BLAST tool, based on the MPI Master-Worker paradigm. The training set contains proteins of known structure. Using Grid technologies and efficient mechanisms for running the tools and extracting the data, the time needed to train the neural network is dramatically reduced, whereas the results are comparable to a set of well-known predictor tools.
Keywords :
backpropagation; biology computing; grid computing; molecular biophysics; molecular configurations; multilayer perceptrons; proteins; MPI Master-Worker paradigm; PSI-BLAST tool; back-propagation algorithm; feedforward multilayer perceptron; grid; multiple sequence alignment; neural network; protein secondary structure predictor; Algorithm design and analysis; Data mining; Feedforward neural networks; Feedforward systems; Multi-layer neural network; Multilayer perceptrons; Neural networks; Predictive models; Proteins; Software algorithms; Grid computing; Web services; neural networks; protein structure prediction; Algorithms; Artificial Intelligence; Computer Simulation; Internet; Models, Chemical; Models, Molecular; Protein Structure, Secondary; Proteins; Sequence Analysis, Protein; Software; User-Computer Interface;
fLanguage :
English
Journal_Title :
NanoBioscience, IEEE Transactions on
Publisher :
ieee
ISSN :
1536-1241
Type :
jour
DOI :
10.1109/TNB.2007.897475
Filename :
4220634
Link To Document :
بازگشت