DocumentCode
2719467
Title
Parallel pattern identification in biological sequences on clusters
Author
Huang, Chun-Hsi ; Biswas, Ratnabali
Author_Institution
Dept. of Comput. Sci. & Eng., Connecticut Univ., Storrs, CT, USA
fYear
2002
fDate
2002
Firstpage
127
Lastpage
134
Abstract
This paper presents a low communication overhead parallel algorithm for pattern matching in biological sequences. Given such a sequence of length n and a pattern of length m, we conclude an algorithm with five computation/communication phases, each requiring O(n) computation time and only O(p) message units. The low communication overhead of the algorithm is essential to achieving reasonable speedups on clusters, where the interprocessor communication latency is usually higher Previous parallel implementations use straightforward domain decomposition based on existing sequential algorithms and rely on parallel machines with low-latency interconnection network and fast hardware support for processor synchronization.
Keywords
biology computing; communication complexity; parallel algorithms; sequences; string matching; workstation clusters; biological sequences; clusters; computation time; computation/communication phases; interprocessor communication latency; low communication overhead parallel algorithm; message units; parallel pattern identification; pattern matching; speedups; Biology computing; Clustering algorithms; DNA; Delay; Hardware; Multiprocessor interconnection networks; Parallel algorithms; Pattern matching; Proteins; Sequences;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing, 2002. Proceedings. 2002 IEEE International Conference on
Print_ISBN
0-7695-2066-9
Type
conf
DOI
10.1109/CLUSTR.2002.1137737
Filename
1137737
Link To Document