DocumentCode :
1866227
Title :
Conditional NML Universal Models
Author :
Rissanen, Jorma ; Roos, Teemu
Author_Institution :
Complex Systems Computation Group, Helsinki Institute for Information Technology, University of Helsinki and Helsinki University of Technology, Email: jorma.rissanen@mdl-research.org
fYear :
2007
fDate :
Jan. 29 2007-Feb. 2 2007
Firstpage :
337
Lastpage :
341
Abstract :
The NML (normalized maximum likelihood) universal model has certain minmax optimal properties but it has two shortcomings: the normalizing coefficient can be evaluated in a closed form only for special model classes, and it does not define a random process so that it cannot be used for prediction. We present a universal conditional NML model, which has minmax optimal properties similar to those of the regular NML model. However, unlike NML, the conditional NML model defines a random process which can be used for prediction. It also admits a recursive evaluation for data compression. The conditional normalizing coefficient is much easier to evaluate, for instance, for tree machines than the integral of the square root of the Fisher information in the NML model. For Bernoulli distributions, the conditional NML model gives a predictive probability, which behaves like the Krichevsky-Trofimov predictive probability, actually slightly better for extremely skewed strings. For some model classes, it agrees with the predictive probability found earlier by Takimoto and Warmuth, as the solution to a different more restrictive minmax problem. We also calculate the CNML models for the generalized Gaussian regression models, and in particular for the cases where the loss function is quadratic, and show that the CNML model achieves asymptotic optimality in terms of the mean ideal code length. Moreover, the quadratic loss, which represents fitting errors as noise rather than prediction errors, can be shown to be smaller than what can be achieved with the NML as well as with the so-called plug-in or the predictive MDL model.
Keywords :
Gaussian processes; maximum likelihood estimation; minimax techniques; probability; regression analysis; Bernoulli distributions; Gaussian regression model; conditional NML universal models; conditional normalizing coefficient; minmax optimal properties; normalized maximum likelihood universal model; predictive probability; Data compression; Density functional theory; Information technology; Maximum likelihood estimation; Minimax techniques; Parametric statistics; Predictive models; Probability distribution; Random processes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Theory and Applications Workshop, 2007
Conference_Location :
La Jolla, CA
Print_ISBN :
978-0-615-15314-8
Type :
conf
DOI :
10.1109/ITA.2007.4357600
Filename :
4357600
Link To Document :
بازگشت