Title :
Microparallelism and High-Performance Protein Matching
Author :
Alpern, Bowen ; Carter, Larry ; Gatlin, Kang Su
Abstract :
The Smith-Waterman algorithm is a computationally-intensive string-matching operation that is fundamental to the analysis of proteins and genes. In this paper, we explore the use of some standard and novel techniques for improving its performance. We begin by tuning the algorithm using conventional techniques. These make modest performance improvements by providing efficient cache usage and inner-loop code. One novel technique uses the z-buffer operations of the Intel i860 architecture to perform 4 independent computations in parallel. This achieves a five-fold speedup over the optimized code (six-fold over the original). We also describe a related technique that could be used by processors that have 64-bit integer operations, but no z-buffer. Another new technique uses floating-point multiplies and adds in place of the standard algorithm´s integer additions and maximum operations. This gains more than a three-fold speedup on the IBM POWER2 processor. This method doesn´t give the identical answers as the original program, but experimental evidence shows that the inaccuracies are small and do not affect which strings are chosen as good matches by the algorithm.
Keywords :
DNA; dynamic programming; high-performance scientific computing; molecular biology; parallel computers; program optimization techniques; protein sequence analysis; string matching; z-buffer; Algorithm design and analysis; Amino acids; Biology computing; Computer architecture; Concurrent computing; DNA computing; Databases; Dynamic programming; Protein sequence; Scientific computing; DNA; dynamic programming; high-performance scientific computing; molecular biology; parallel computers; program optimization techniques; protein sequence analysis; string matching; z-buffer;
Conference_Titel :
Supercomputing, 1995. Proceedings of the IEEE/ACM SC95 Conference
Print_ISBN :
0-89791-816-9
DOI :
10.1109/SUPERC.1995.242795