• DocumentCode
    122960
  • Title

    A space-efficient solution to find the maximum overlap using a compressed suffix array

  • Author

    Rachid, Maan Haj ; Malluhi, Qutaibah ; Abouelhoda, Mohamed

  • fYear
    2014
  • fDate
    17-20 Feb. 2014
  • Firstpage
    329
  • Lastpage
    333
  • Abstract
    Compressed indices are important data structures in stringology. Compressed versions of many well-known data structures such as suffix tree and suffix array, which are used in string matching problems, have been studied and proposed. This paper takes advantage of a very recent compressed suffix array to build a space-economic solution for an important bioinformatics problem, namely the all-pairs suffix prefix problem. The paper also presents a simple technique for parallelizing the solution. Our results show that the proposed solution consumes less than one fifth of the space required by other solutions based on standard data structures. In addition, our results demonstrate that good performance scalability can be achieved by employing the proposed parallel algorithm.
  • Keywords
    bioinformatics; data structures; parallel algorithms; string matching; all-pair suffix prefix problem; bioinformatics problem; compressed indices; compressed suffix array; data structures; maximum overlap; parallel algorithm; performance scalability; space-economic solution; space-efficient solution; string matching problems; stringology; suffix tree; Algorithm design and analysis; Arrays; Assembly; Bioinformatics; Genomics; Indexes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Biomedical Engineering (MECBME), 2014 Middle East Conference on
  • Conference_Location
    Doha
  • Type

    conf

  • DOI
    10.1109/MECBME.2014.6783270
  • Filename
    6783270