• DocumentCode
    3574070
  • Title

    A fast algorithm for calculating minimum redundancy prefix codes with unsorted alphabet

  • Author

    Yupeng Tai ; Haibin Wang

  • Author_Institution
    State Key Lab. of Acoust., Inst. of Acoust., Beijing, China
  • fYear
    2014
  • Firstpage
    230
  • Lastpage
    235
  • Abstract
    Minimum redundancy coding (also known as Huffman code) is one of the most well-known algorithm of data compression. Many efforts have been made to improve the efficiency of it. Most of them are based on the assumption that the input alphabet has been already sorted. In this paper, we propose an algorithm of calculating the minimum-redundancy codes directly with unsorted alphabet. It consumes only O(nlog(n/k)) time in the worst cases, where n is the alphabet size and k is the longest codeword length. It is fast because only a part of the symbols requires to be sorted before the final minimum redundancy code is generated. The theoretical analysis and numerical simulation results show that this algorithm achieves a substantial improvement upon the best previous O(nlogn) algorithms for this problem.
  • Keywords
    Huffman codes; data compression; redundancy; Huffman code; codeword length; data compression; minimum redundancy prefix code calculation; unsorted alphabet; Acoustics; Algorithm design and analysis; Buildings; Huffman coding; Redundancy; Sorting; Huffman coding; Lossless data compression; Minimum redundancy code; Prefix-free code;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications and Networking in China (CHINACOM), 2014 9th International Conference on
  • Type

    conf

  • DOI
    10.1109/CHINACOM.2014.7054291
  • Filename
    7054291