DocumentCode :
122960
Title :
A space-efficient solution to find the maximum overlap using a compressed suffix array
Author :
Rachid, Maan Haj ; Malluhi, Qutaibah ; Abouelhoda, Mohamed
fYear :
2014
fDate :
17-20 Feb. 2014
Firstpage :
329
Lastpage :
333
Abstract :
Compressed indices are important data structures in stringology. Compressed versions of many well-known data structures such as suffix tree and suffix array, which are used in string matching problems, have been studied and proposed. This paper takes advantage of a very recent compressed suffix array to build a space-economic solution for an important bioinformatics problem, namely the all-pairs suffix prefix problem. The paper also presents a simple technique for parallelizing the solution. Our results show that the proposed solution consumes less than one fifth of the space required by other solutions based on standard data structures. In addition, our results demonstrate that good performance scalability can be achieved by employing the proposed parallel algorithm.
Keywords :
bioinformatics; data structures; parallel algorithms; string matching; all-pair suffix prefix problem; bioinformatics problem; compressed indices; compressed suffix array; data structures; maximum overlap; parallel algorithm; performance scalability; space-economic solution; space-efficient solution; string matching problems; stringology; suffix tree; Algorithm design and analysis; Arrays; Assembly; Bioinformatics; Genomics; Indexes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Biomedical Engineering (MECBME), 2014 Middle East Conference on
Conference_Location :
Doha
Type :
conf
DOI :
10.1109/MECBME.2014.6783270
Filename :
6783270
Link To Document :
بازگشت