DocumentCode
2743389
Title
Adapting the Knuth-Morris-Pratt algorithm for pattern matching in Huffman encoded texts
Author
Daptardar, Ajay ; Shapira, Dana
Author_Institution
Dept. of Comput. Sci., Brandeis Univ., Waltham, MA, USA
fYear
2004
fDate
23-25 March 2004
Firstpage
535
Abstract
This paper presents a compressed pattern matching in Huffman encoded texts. A modified Knuth-Morris-Pratt (KMP) algorithm is used in order to overcome the problem of false matches. This paper also proposes a bitwise KMP algorithm that can move one extra bit in the case of a mismatch, since the alphabet is binary. The KMP algorithm is combined with two Huffman decoding algorithms called sk-kmp and win-kmp to handle more than a single bit per machine operation. However, skeleton trees are used for efficient decoding of Huffman encoded texts.
Keywords
Huffman codes; binary codes; data compression; pattern matching; tree codes; Huffman decoding algorithm; Huffman encoded text; Knuth-Morris-Pratt algorithm; binary alphabet; bitwise KMP algorithm; false match; pattern matching compression; single bit per machine operation; skeleton tree; Australia; Computer science; Data compression; Decoding; Humans; Information retrieval; Pattern matching; Skeleton;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Compression Conference, 2004. Proceedings. DCC 2004
ISSN
1068-0314
Print_ISBN
0-7695-2082-0
Type
conf
DOI
10.1109/DCC.2004.1281511
Filename
1281511
Link To Document