Title :
Optimal suffix tree construction with large alphabets
Author_Institution :
Dept. of Comput. Sci., Rutgers Univ., Piscataway, NJ, USA
Abstract :
The suffix tree of a string is the fundamental data structure of combinatorial pattern matching. Weiner (1973), who introduced the data structure, gave an O(n)-time algorithm for building the suffix tree of an n-character string drawn from a constant size alphabet. In the comparison model, there is a trivial Ω(n log n)-time lower bound based on sorting, and Weiner´s algorithm matches this bound trivially. For integer alphabets, a substantial gap remains between the known upper and lower bounds, and closing this gap is the main open question in the construction of suffix trees. There is no super-linear lower bound, and the fastest known algorithm was the O(n log n) time comparison based algorithm. We settle this open problem by closing the gap: we build suffix trees in linear time for integer alphabet
Keywords :
pattern matching; tree data structures; combinatorial pattern matching; data structure; integer alphabet; integer alphabets; large alphabets; sorting; suffix tree; Buildings; Computer science; Data structures; Engineering profession; Labeling; Pattern matching; Sorting; Tree data structures; Upper bound; World Wide Web;
Conference_Titel :
Foundations of Computer Science, 1997. Proceedings., 38th Annual Symposium on
Conference_Location :
Miami Beach, FL
Print_ISBN :
0-8186-8197-7
DOI :
10.1109/SFCS.1997.646102