Title :
Universal Divergence Estimation for Finite-Alphabet Sources
Author :
Cai, Haixiao ; Kulkarni, Sanjeev R. ; Verdú, Sergio
Author_Institution :
Dept. of Electr. Eng., Princeton Univ., NJ
Abstract :
This paper studies universal estimation of divergence from the realizations of two unknown finite-alphabet sources. Two algorithms that borrow techniques from data compression are presented. The first divergence estimator applies the Burrows-Wheeler block sorting transform to the concatenation of the two realizations; consistency of this estimator is shown for all finite-memory sources. The second divergence estimator is based on the Context Tree Weighting method; consistency is shown for all sources whose memory length does not exceed a known bound. Experimental results show that both algorithms perform similarly and outperform string-matching and plug-in methods
Keywords :
data compression; sorting; Burrows-Wheeler block sorting; context tree weighting method; data compression; finite-alphabet source; universal divergence estimation; Bioinformatics; Convergence; Data compression; Entropy; Genomics; Markov processes; Mutual information; Phylogeny; Sorting; Source coding; Block sorting; Burrows–Wheeler transform; Markov sources; context tree weighting method; divergence estimation; information divergence; universal methods;
Journal_Title :
Information Theory, IEEE Transactions on
DOI :
10.1109/TIT.2006.878182