DocumentCode
900373
Title
Efficient Algorithms for the Inverse Sort Transform
Author
Nong, Ge ; Zhang, Sen
Author_Institution
Sun Yat-Sen Univ., Guangzhou
Volume
56
Issue
11
fYear
2007
Firstpage
1564
Lastpage
1574
Abstract
As an important variant of the Burrows-Wheeler Transform (BWT), the Sort Transform (ST) can speed up the transformation by sorting only a portion of the matrix. However, because the currently known inverse ST algorithms need to retrieve the complete k-order contexts and use hash tables, they are less efficient than the inverse BWT. In this paper, we propose three fast and memory-efficient inverse ST algorithms. The first algorithm uses two auxiliary vectors to replace the hash tables. The algorithm achieves O(kN) time and space complexities for a text of N characters under the context order k. The second uses two additional compact "alternate vectors" to further eliminate the need to restore all of the k-order contexts and achieve O(N) space complexity. Moreover, the third uses a "doubling technique" to further reduce the time complexity to O(N log2 k). The hallmark of these three algorithms is that they can invert the ST in a manner similar to inverting BWT in that they all make use of precalculated auxiliary mapping vectors and require no hash tables. These unifying algorithms can also better explain the connection between the BWT and the ST: Not only can their forward components be performed by the same algorithm framework, but their respective inverse components can also be efficiently conducted by the unifying algorithm framework proposed in the present work.
Keywords
computational complexity; data compression; matrix algebra; sorting; transforms; Burrows-Wheeler transform; auxiliary vectors; data compression; doubling technique; memory-efficient inverse ST algorithms; sort transform; space complexity; time complexity; Algorithm design and analysis; Arithmetic; Clustering algorithms; DNA; Data compression; Electrocardiography; Helium; Image coding; Sequences; Sorting; Burrows-Wheeler transform; algorithm design; data compression.; inverse sort transform; limit-order contexts;
fLanguage
English
Journal_Title
Computers, IEEE Transactions on
Publisher
ieee
ISSN
0018-9340
Type
jour
DOI
10.1109/TC.2007.70762
Filename
4336303
Link To Document