Title :
Optimal and asymptotically optimal decision rules for sequential screening and resource allocation
Author_Institution :
Lab. I3S, Univ. de Nice-Sophia Antipolis, Valbonne, France
fDate :
5/1/2001 12:00:00 AM
Abstract :
We consider the problem of maximizing the expected sum of n variables Xk chosen sequentially in an i.i.d. sequence of length N. It is equivalent to the following resource allocation problem: n machines have to be allocated to N jobs of value Xk (k=1,…,N) arriving sequentially, the ith machine has a (known) probability pi to perform the job successfully and the total expected reward must be maximized. The optimal solution of this stochastic dynamic-programming problem is derived when the distribution of the Xks is known: the optimal threshold for accepting X k, or allocating the ith machine to job k, is given by a backward recurrence equation. This optimal solution is compared to the simpler (but suboptimal) open-loop feedback-optimal solution for which the threshold is constant, and their asymptotic behaviors are investigated. The asymptotic behavior of the optimal threshold is used to derive a simple open-loop solution, which is proved to be asymptotically optimal (N→∞ with n fixed) for a large class of distributions for Xk
Keywords :
decision theory; design of experiments; dynamic programming; parameter estimation; probability; resource allocation; stochastic programming; asymptotic behaviors; asymptotically optimal decision rules; backward recurrence equation; open-loop feedback-optimal solution; optimal threshold; sequential screening; stochastic dynamic-programming problem; total expected reward; Character generation; Delay; Design for experiments; Detectors; Difference equations; Event detection; Nuclear physics; Random variables; Resource management; Stochastic processes;
Journal_Title :
Automatic Control, IEEE Transactions on