مرکز منطقه ای اطلاع رساني علوم و فناوري - Minimax Estimation of Functionals of Discrete Distributions

DocumentCode :

1764518

Title :

Minimax Estimation of Functionals of Discrete Distributions

Author :

Jiantao Jiao ; Venkat, Kartik ; Yanjun Han ; Weissman, Tsachy

Author_Institution :

Dept. of Electr. Eng., Stanford Univ., Stanford, CA, USA

Volume :

Issue :

fYear :

2015

fDate :

42125

Firstpage :

2835

Lastpage :

2885

Abstract :

We propose a general methodology for the construction and analysis of essentially minimax estimators for a wide class of functionals of finite dimensional parameters, and elaborate on the case of discrete distributions, where the support size S is unknown and may be comparable with or even much larger than the number of observations n. We treat the respective regions where the functional is nonsmooth and smooth separately. In the nonsmooth regime, we apply an unbiased estimator for the best polynomial approximation of the functional whereas, in the smooth regime, we apply a bias-corrected version of the maximum likelihood estimator (MLE). We illustrate the merit of this approach by thoroughly analyzing the performance of the resulting schemes for estimating two important information measures: 1) the entropy H(P) = Σ^S_i=1 -p_i ln p_i and 2) F_α(P) = Σ^S_i=1 p^α_i, α > 0. We obtain the minimax L₂ rates for estimating these functionals. In particular, we demonstrate that our estimator achieves the optimal sample complexity n × S/ln S for entropy estimation. We also demonstrate that the sample complexity for estimating F_α(P), 0 <; α <; 1, is n×S^1/α/ln S, which can be achieved by our estimator but not the MLE. For 1 <; α <; 3/2, we show the minimax L₂ rate for estimating F_α(P) is (n ln n)^-2(α-1) for infinite support size, while the maximum L₂ rate for the MLE is n^-2(α-1). For all the above cases, the behavior of the minimax rate-optimal estimators with n samples is essentially that of the MLE (plug-in rule) with n ln n samples, which we term “effective sample size enlargement.” We highlight the practical advantages of our schemes for the estimation - f entropy and mutual information. We compare our performance with various existing approaches, and demonstrate that our approach reduces running time and boosts the accuracy. Moreover, we show that the minimax rate-optimal mutual information estimator yielded by our framework leads to significant performance boosts over the Chow-Liu algorithm in learning graphical models. The wide use of information measure estimation suggests that the insights and estimators obtained in this paper could be broadly applicable.

Keywords :

entropy; functional analysis; maximum likelihood estimation; minimax techniques; polynomial approximation; Chow-Liu algorithm; MLE; bias-corrected version; discrete distribution; effective sample size enlargement; entropy estimation; finite dimensional parameter functional; graphical model; maximum likelihood estimator; minimax rate-optimal estimation; mutual information; nonsmooth regime; polynomial approximation; Approximation methods; Complexity theory; Entropy; Maximum likelihood estimation; Mutual information; Polynomials; Chow–Liu algorithm; Chow???Liu algorithm; Mean squared error; R??nyi entropy; Renyi entropy; approximation theory; approximation theory, minimax lower bound; entropy estimation; high dimensional statistics; maximum likelihood estimator; minimax lower bound; minimax-optimality; nonsmooth functional estimation; polynomial approximation;

fLanguage :

English

Journal_Title :

Information Theory, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9448

Type :

jour

DOI :

10.1109/TIT.2015.2412945

Filename :

7060676

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1764518