Title :
Adapting the Knuth-Morris-Pratt algorithm for pattern matching in Huffman encoded texts
Author :
Daptardar, Ajay ; Shapira, Dana
Author_Institution :
Dept. of Comput. Sci., Brandeis Univ., Waltham, MA, USA
Abstract :
This paper presents a compressed pattern matching in Huffman encoded texts. A modified Knuth-Morris-Pratt (KMP) algorithm is used in order to overcome the problem of false matches. This paper also proposes a bitwise KMP algorithm that can move one extra bit in the case of a mismatch, since the alphabet is binary. The KMP algorithm is combined with two Huffman decoding algorithms called sk-kmp and win-kmp to handle more than a single bit per machine operation. However, skeleton trees are used for efficient decoding of Huffman encoded texts.
Keywords :
Huffman codes; binary codes; data compression; pattern matching; tree codes; Huffman decoding algorithm; Huffman encoded text; Knuth-Morris-Pratt algorithm; binary alphabet; bitwise KMP algorithm; false match; pattern matching compression; single bit per machine operation; skeleton tree; Australia; Computer science; Data compression; Decoding; Humans; Information retrieval; Pattern matching; Skeleton;
Conference_Titel :
Data Compression Conference, 2004. Proceedings. DCC 2004
Print_ISBN :
0-7695-2082-0
DOI :
10.1109/DCC.2004.1281511