Title : 
The strong law of large numbers for sequential decisions under uncertainty
         
        
        
            Author_Institution : 
Inf. Syst. Lab., Stanford Univ., CA, USA
         
        
        
        
        
            fDate : 
5/1/1994 12:00:00 AM
         
        
        
        
            Abstract : 
Combines optimization and ergodic theory to characterize the optimum long-run average performance that can be asymptotically attained by nonanticipating sequential decisions. Let {Xt} be a stationary ergodic process, and suppose an action bt must be selected in a space ℬ with knowledge of the t-past (X0, ···, Xt-1) at the beginning of every period t⩾0. Action bt will incur a loss l(bt, Xt) at the end of period t when the random variable Xt  is revealed. The author proves under mild integrability conditions that the optimum strategy is to select actions that minimize the conditional expected loss given the currently available information at each step. The minimum long-run average loss per decision can be approached arbitrarily closely by strategies that are finite-order Markov, and under certain continuity conditions, it is equal to the minimum expected loss given the infinite past. If the loss l(b, x) is bounded and continuous and if the space ℬ is compact, then the minimum can be asymptotically attained, even if the distribution of the process {Xt} is unknown a priori and must be learned from experience
         
        
            Keywords : 
Markov processes; decision theory; information theory; minimisation; statistical mechanics; uncertainty handling; conditional expected loss; continuity conditions; ergodic theory; finite-order Markov strategy; mild integrability conditions; minimization; minimum long-run average loss per decision; nonanticipating sequential decisions; optimization; optimum long-run average performance; random variable; sequential decisions; stationary ergodic process; strong law of large numbers; uncertainty; Decision theory; Helium; Information systems; Information theory; Investments; Pattern classification; Predictive models; Random variables; Space stations; Uncertainty;
         
        
        
            Journal_Title : 
Information Theory, IEEE Transactions on