Title :
A SIMD solution to the sequence comparison problem on the MGAP
Author :
Borah, Manjit ; Bajwa, Raminder S. ; Hannenhalli, Sridhar ; Irwin, Mary Jane
Author_Institution :
Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
Abstract :
Molecular biologists frequently compare an unknown biosequence with a set of other known biosequences to find the sequence which is maximally similar, with the hope that what is true of one sequence, either physically or functionally, could be true of its analogue. Even though efficient dynamic programming algorithms exist for the problem, when the size of the database is large, the time required is quite long, even for moderate length sequences. In this paper, we present an efficient pipelined SIMD solution to the sequence alignment problem on the Micro-Grain Array Processor (MGAP), a fine-grained massively parallel array of processors with nearest-neighbor connections. The algorithm compares K sequences of length O(M) with the actual sequence of length N, in O(M+N+K) time with O(MN) processors, which is AT-optimal. The implementation on the MGAP computes at the rate of about 0.1 million comparisons per second for sequences of length 128
Keywords :
biology computing; computational complexity; dynamic programming; molecular biophysics; parallel algorithms; pipeline processing; AT-optimal algorithm; MGAP parallel computer; Micro-Grain Array Processor; biosequence comparison problem; database size; dynamic programming algorithms; fine-grained massively parallel processor array; maximally similar sequence; molecular biology; nearest-neighbor connections; pipelined SIMD solution; sequence alignment problem; Biological information theory; Computer science; Costs; DNA computing; Databases; Genetics; Heuristic algorithms; Military computing; Nearest neighbor searches; Sequences;
Conference_Titel :
Application Specific Array Processors, 1994. Proceedings. International Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
0-8186-6517-3
DOI :
10.1109/ASAP.1994.331791