Title :
Performance of universal codes over infinite alphabets
Author :
Orlitsky, Alon ; Santhanam, Narayana P.
Author_Institution :
Univ. of California, USA
Abstract :
It was known that universal compression of strings generated by independent and identically distributed sources over infinite alphabets entails infinite per-symbol redundancy. Alternative compression schemes, which decompose the description of such strings into a description of the symbols appearing in the string, and a description of the arrangement of the symbols form were presented. Two descriptions of the symbol arrangement were considered: shapes and patterns. Roughly speaking, shapes describe the relative magnitude of the symbols while patterns describe only the order in which they appear. The per-symbol worst-case redundancy of compressing shapes is a positive constant less than one, and the per-symbol redundancy of compressing patterns diminishes to zero as the block-length increases were proven. Some results on sequential pattern compression were also mentioned.
Keywords :
data compression; redundancy; source coding; string matching; block-length; distributed sources; i.i.d. sources; independent sources; infinite alphabet; infinite per-symbol redundancy; patterns; performance evaluation; relative magnitude; sequential pattern compression; shapes; string compression; symbol arrangement; universal code; universal compression; worst-case redundancy; Data compression; Equations; Facsimile; Image coding; Probability distribution; Shape;
Conference_Titel :
Data Compression Conference, 2003. Proceedings. DCC 2003
Print_ISBN :
0-7695-1896-6
DOI :
10.1109/DCC.2003.1194031