Title :
Linear Suffix Array Construction by Almost Pure Induced-Sorting
Author :
Nong, Ge ; Zhang, Sen ; Chan, Wai Hong
Author_Institution :
Comput. Sci. Dept., Sun Yat-Sen Univ., Guangzhou
Abstract :
We present a linear time and space suffix array (SA) construction algorithm called the SA-IS algorithm.The SA-IS algorithm is novel because of the LMS-substrings used for the problem reduction and the pure induced-sorting (specially coined for this algorithm)used to propagate the order of suffixes as well as that of LMS-substrings, which makes the algorithm almost purely relying on induced sorting at both its crucial steps.The pure induced-sorting renders the algorithm an elegant design and in turn a surprisingly compact implementation which consists of less than 100 lines of C code.The experimental results demonstrate that this newly proposed algorithm yields noticeably better time and space efficiencies than all the currently published linear time algorithms for SA construction.
Keywords :
data handling; data structures; C code; LMS-substrings; SA-IS algorithm; linear suffix array construction algorithm; linear time algorithms; problem reduction; pure induced sorting; Algorithm design and analysis; Computer science; Data compression; Data structures; Educational institutions; Indexing; Information retrieval; Mathematics; Sorting; Sun; Suffix array; algorithm design; divide-and-conquer; linear time;
Conference_Titel :
Data Compression Conference, 2009. DCC '09.
Conference_Location :
Snowbird, UT
Print_ISBN :
978-1-4244-3753-5
DOI :
10.1109/DCC.2009.42