DocumentCode :
1205473
Title :
The strong law of large numbers for sequential decisions under uncertainty
Author :
Algoet, Paul H.
Author_Institution :
Inf. Syst. Lab., Stanford Univ., CA, USA
Volume :
40
Issue :
3
fYear :
1994
fDate :
5/1/1994 12:00:00 AM
Firstpage :
609
Lastpage :
633
Abstract :
Combines optimization and ergodic theory to characterize the optimum long-run average performance that can be asymptotically attained by nonanticipating sequential decisions. Let {Xt} be a stationary ergodic process, and suppose an action bt must be selected in a space ℬ with knowledge of the t-past (X0, ···, Xt-1) at the beginning of every period t⩾0. Action bt will incur a loss l(bt, Xt) at the end of period t when the random variable Xt is revealed. The author proves under mild integrability conditions that the optimum strategy is to select actions that minimize the conditional expected loss given the currently available information at each step. The minimum long-run average loss per decision can be approached arbitrarily closely by strategies that are finite-order Markov, and under certain continuity conditions, it is equal to the minimum expected loss given the infinite past. If the loss l(b, x) is bounded and continuous and if the space ℬ is compact, then the minimum can be asymptotically attained, even if the distribution of the process {Xt} is unknown a priori and must be learned from experience
Keywords :
Markov processes; decision theory; information theory; minimisation; statistical mechanics; uncertainty handling; conditional expected loss; continuity conditions; ergodic theory; finite-order Markov strategy; mild integrability conditions; minimization; minimum long-run average loss per decision; nonanticipating sequential decisions; optimization; optimum long-run average performance; random variable; sequential decisions; stationary ergodic process; strong law of large numbers; uncertainty; Decision theory; Helium; Information systems; Information theory; Investments; Pattern classification; Predictive models; Random variables; Space stations; Uncertainty;
fLanguage :
English
Journal_Title :
Information Theory, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9448
Type :
jour
DOI :
10.1109/18.335876
Filename :
335876
Link To Document :
بازگشت