Title :
FROST: Revisited and Distributed
Author :
Poirriez, Vincent ; Marin, Antoine ; Andonov, Rumen ; Gibrat, Jean-François
Author_Institution :
Univ. de Valenciennes, France
Abstract :
FROST (Fold Recognition-Oriented Search Tool) is a software whose purpose is to assign a 3D structure to a protein sequence. It is based on a series of filters and uses a database of about 1200 known 3D structures, each one associated with empirically determined score distributions. FROST uses these distributions to normalize the score obtained when a protein sequence is aligned with a particular 3D structure. Computing these distributions is extremely time consuming; it requires solving about 1,200,000 hard combinatorial optimization problems and takes about 40 days on a 2.4 GHz computer. This paper describes how FROST has been successfully redesigned and structured in modules and independent tasks. The new package organization allows these tasks to be distributed and executed in parallel using a centralized dynamic load balancing strategy. On a cluster of 12 PCs, computing the score distributions takes now about 3 days which represents a parallelization efficiency of about 1.
Keywords :
biology computing; combinatorial mathematics; multi-threading; optimisation; proteins; resource allocation; search problems; software packages; 3D protein sequence structure assignment; FROST software; Fold Recognition-Oriented Search Tool; combinatorial optimization problems; dynamic load balancing strategy; parallel algorithm; protein threading; Amino acids; Computational biology; Databases; Distributed computing; Filters; Load management; Packaging; Personal communication networks; Protein sequence; Software tools; large scale problems; parallel algorithms; protein threading;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2005. Proceedings. 19th IEEE International
Print_ISBN :
0-7695-2312-9
DOI :
10.1109/IPDPS.2005.231