Parallelized protein secondary structure prediction

Author

Qi, Yu-Tao ; Lin, Feng

Author_Institution

Bioinformatics Res. Center, Nanyang Technol. Univ., Singapore

Volume

4

fYear

2004

fDate

26-29 Aug. 2004

Firstpage

2074

Abstract

Functional characterization of a protein sequence is one of the most frequent problems in biology. Secondary structure prediction is a useful first step in understanding how the amino acid sequence of a protein determines the native state. There are many previous famous prediction methods available now, although most of them allow users to submit their queries to dedicated servers, bulk protein sequence prediction service such as the whole database sequence prediction is still not widely available or requires tedious waiting tune. The author proposed a parallelized protein secondary structure prediction method namely mpiPSSP, using message passing interface (MPI) on a multi-node compute cluster. A preprocess mpiSEQ processes and converts the fasta-like database to the format preferred by mpiPSSP, and a post process mpiRES collects and reports the result as well. mpiPSSP can self-determine the appropriate number of processors according to the size of the querying database or also run using the user-desired number of processors. As the tasks are made independent of each other, a highly scalable solution is achieved.

Keywords

biology computing; proteins; workstation clusters; amino acid sequence; functional characterization; message passing interface; multinode compute cluster; parallelized protein secondary structure prediction; protein sequence; Bioinformatics; Biology computing; Computer interfaces; Concurrent computing; Databases; Message passing; PROM; Prediction methods; Protein engineering; Protein sequence;

fLanguage

English

Publisher

ieee

Conference_Titel

Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on

Print_ISBN

0-7803-8403-2

Type

conf

DOI

10.1109/ICMLC.2004.1382137

Filename

1382137