DocumentCode
424176
Title
Parallelized protein secondary structure prediction
Author
Qi, Yu-Tao ; Lin, Feng
Author_Institution
Bioinformatics Res. Center, Nanyang Technol. Univ., Singapore
Volume
4
fYear
2004
fDate
26-29 Aug. 2004
Firstpage
2074
Abstract
Functional characterization of a protein sequence is one of the most frequent problems in biology. Secondary structure prediction is a useful first step in understanding how the amino acid sequence of a protein determines the native state. There are many previous famous prediction methods available now, although most of them allow users to submit their queries to dedicated servers, bulk protein sequence prediction service such as the whole database sequence prediction is still not widely available or requires tedious waiting tune. The author proposed a parallelized protein secondary structure prediction method namely mpiPSSP, using message passing interface (MPI) on a multi-node compute cluster. A preprocess mpiSEQ processes and converts the fasta-like database to the format preferred by mpiPSSP, and a post process mpiRES collects and reports the result as well. mpiPSSP can self-determine the appropriate number of processors according to the size of the querying database or also run using the user-desired number of processors. As the tasks are made independent of each other, a highly scalable solution is achieved.
Keywords
biology computing; proteins; workstation clusters; amino acid sequence; functional characterization; message passing interface; multinode compute cluster; parallelized protein secondary structure prediction; protein sequence; Bioinformatics; Biology computing; Computer interfaces; Concurrent computing; Databases; Message passing; PROM; Prediction methods; Protein engineering; Protein sequence;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
Print_ISBN
0-7803-8403-2
Type
conf
DOI
10.1109/ICMLC.2004.1382137
Filename
1382137
Link To Document