Title : 
Fast searching over compressed text using a new coding technique: tagged sub-optimal code (TSC)
         
        
            Author : 
Bellaachia, Abdelghani ; Rassan, I.A.L.
         
        
            Author_Institution : 
Washington Univ., USA
         
        
        
        
        
            Abstract : 
In this paper, a new coding technique called tagged sub-optimal code (TSC) is proposed. TSC is a variable-length sub-optimal code that supports minimal prefix property. TSC technique is beneficial in many types of applications: speeding up string matching over compressed text, speeding decoding process, robustness of error detection and recovery during transmission, as well as in general-purpose integer representation code. The experimental results show that TSC is 8.9 times faster than string matching over compressed text using Huffman encoding, and 3 times faster in the decoding process.
         
        
            Keywords : 
Huffman codes; data compression; decoding; error detection; error detection codes; string matching; text analysis; variable length codes; Huffman encoding; coding technique; compressed text; error detection; fast search; general-purpose integer representation code; minimal prefix property; recovery transmission; speeding decoding process; string matching; tagged sub-optimal code; variable-length sub-optimal code; Data compression; Data processing; Decoding; Delay; Encoding; Notice of Violation; Robustness; Table lookup;
         
        
        
        
            Conference_Titel : 
Data Compression Conference, 2004. Proceedings. DCC 2004
         
        
        
            Print_ISBN : 
0-7695-2082-0
         
        
        
            DOI : 
10.1109/DCC.2004.1281502