• DocumentCode
    900373
  • Title

    Efficient Algorithms for the Inverse Sort Transform

  • Author

    Nong, Ge ; Zhang, Sen

  • Author_Institution
    Sun Yat-Sen Univ., Guangzhou
  • Volume
    56
  • Issue
    11
  • fYear
    2007
  • Firstpage
    1564
  • Lastpage
    1574
  • Abstract
    As an important variant of the Burrows-Wheeler Transform (BWT), the Sort Transform (ST) can speed up the transformation by sorting only a portion of the matrix. However, because the currently known inverse ST algorithms need to retrieve the complete k-order contexts and use hash tables, they are less efficient than the inverse BWT. In this paper, we propose three fast and memory-efficient inverse ST algorithms. The first algorithm uses two auxiliary vectors to replace the hash tables. The algorithm achieves O(kN) time and space complexities for a text of N characters under the context order k. The second uses two additional compact "alternate vectors" to further eliminate the need to restore all of the k-order contexts and achieve O(N) space complexity. Moreover, the third uses a "doubling technique" to further reduce the time complexity to O(N log2 k). The hallmark of these three algorithms is that they can invert the ST in a manner similar to inverting BWT in that they all make use of precalculated auxiliary mapping vectors and require no hash tables. These unifying algorithms can also better explain the connection between the BWT and the ST: Not only can their forward components be performed by the same algorithm framework, but their respective inverse components can also be efficiently conducted by the unifying algorithm framework proposed in the present work.
  • Keywords
    computational complexity; data compression; matrix algebra; sorting; transforms; Burrows-Wheeler transform; auxiliary vectors; data compression; doubling technique; memory-efficient inverse ST algorithms; sort transform; space complexity; time complexity; Algorithm design and analysis; Arithmetic; Clustering algorithms; DNA; Data compression; Electrocardiography; Helium; Image coding; Sequences; Sorting; Burrows-Wheeler transform; algorithm design; data compression.; inverse sort transform; limit-order contexts;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2007.70762
  • Filename
    4336303