Title :
A fast algorithm for calculating minimum redundancy prefix codes with unsorted alphabet
Author :
Yupeng Tai ; Haibin Wang
Author_Institution :
State Key Lab. of Acoust., Inst. of Acoust., Beijing, China
Abstract :
Minimum redundancy coding (also known as Huffman code) is one of the most well-known algorithm of data compression. Many efforts have been made to improve the efficiency of it. Most of them are based on the assumption that the input alphabet has been already sorted. In this paper, we propose an algorithm of calculating the minimum-redundancy codes directly with unsorted alphabet. It consumes only O(nlog(n/k)) time in the worst cases, where n is the alphabet size and k is the longest codeword length. It is fast because only a part of the symbols requires to be sorted before the final minimum redundancy code is generated. The theoretical analysis and numerical simulation results show that this algorithm achieves a substantial improvement upon the best previous O(nlogn) algorithms for this problem.
Keywords :
Huffman codes; data compression; redundancy; Huffman code; codeword length; data compression; minimum redundancy prefix code calculation; unsorted alphabet; Acoustics; Algorithm design and analysis; Buildings; Huffman coding; Redundancy; Sorting; Huffman coding; Lossless data compression; Minimum redundancy code; Prefix-free code;
Conference_Titel :
Communications and Networking in China (CHINACOM), 2014 9th International Conference on
DOI :
10.1109/CHINACOM.2014.7054291