• DocumentCode
    424176
  • Title

    Parallelized protein secondary structure prediction

  • Author

    Qi, Yu-Tao ; Lin, Feng

  • Author_Institution
    Bioinformatics Res. Center, Nanyang Technol. Univ., Singapore
  • Volume
    4
  • fYear
    2004
  • fDate
    26-29 Aug. 2004
  • Firstpage
    2074
  • Abstract
    Functional characterization of a protein sequence is one of the most frequent problems in biology. Secondary structure prediction is a useful first step in understanding how the amino acid sequence of a protein determines the native state. There are many previous famous prediction methods available now, although most of them allow users to submit their queries to dedicated servers, bulk protein sequence prediction service such as the whole database sequence prediction is still not widely available or requires tedious waiting tune. The author proposed a parallelized protein secondary structure prediction method namely mpiPSSP, using message passing interface (MPI) on a multi-node compute cluster. A preprocess mpiSEQ processes and converts the fasta-like database to the format preferred by mpiPSSP, and a post process mpiRES collects and reports the result as well. mpiPSSP can self-determine the appropriate number of processors according to the size of the querying database or also run using the user-desired number of processors. As the tasks are made independent of each other, a highly scalable solution is achieved.
  • Keywords
    biology computing; proteins; workstation clusters; amino acid sequence; functional characterization; message passing interface; multinode compute cluster; parallelized protein secondary structure prediction; protein sequence; Bioinformatics; Biology computing; Computer interfaces; Concurrent computing; Databases; Message passing; PROM; Prediction methods; Protein engineering; Protein sequence;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
  • Print_ISBN
    0-7803-8403-2
  • Type

    conf

  • DOI
    10.1109/ICMLC.2004.1382137
  • Filename
    1382137