DocumentCode
122960
Title
A space-efficient solution to find the maximum overlap using a compressed suffix array
Author
Rachid, Maan Haj ; Malluhi, Qutaibah ; Abouelhoda, Mohamed
fYear
2014
fDate
17-20 Feb. 2014
Firstpage
329
Lastpage
333
Abstract
Compressed indices are important data structures in stringology. Compressed versions of many well-known data structures such as suffix tree and suffix array, which are used in string matching problems, have been studied and proposed. This paper takes advantage of a very recent compressed suffix array to build a space-economic solution for an important bioinformatics problem, namely the all-pairs suffix prefix problem. The paper also presents a simple technique for parallelizing the solution. Our results show that the proposed solution consumes less than one fifth of the space required by other solutions based on standard data structures. In addition, our results demonstrate that good performance scalability can be achieved by employing the proposed parallel algorithm.
Keywords
bioinformatics; data structures; parallel algorithms; string matching; all-pair suffix prefix problem; bioinformatics problem; compressed indices; compressed suffix array; data structures; maximum overlap; parallel algorithm; performance scalability; space-economic solution; space-efficient solution; string matching problems; stringology; suffix tree; Algorithm design and analysis; Arrays; Assembly; Bioinformatics; Genomics; Indexes;
fLanguage
English
Publisher
ieee
Conference_Titel
Biomedical Engineering (MECBME), 2014 Middle East Conference on
Conference_Location
Doha
Type
conf
DOI
10.1109/MECBME.2014.6783270
Filename
6783270
Link To Document