DocumentCode :
1633947
Title :
Noisy Bayesian active learning
Author :
Naghshvar, Mohammad ; Javidi, Tara ; Chaudhuri, Kamalika
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of California San Diego, La Jolla, CA, USA
fYear :
2012
Firstpage :
1626
Lastpage :
1633
Abstract :
Consider the problem of noisy Bayesian active learning given a sample space, a finite label set, and a finite set of label generating functions from the sample space to the label set, also known as the function class. The objective is to identify the function in the function class that generates the labels using as few label queries as possible and with low probability of error despite possible corruption by independent noise. The key to achieving this objective relies on the selection of queries in a strategic and adaptive manner. The problem generalizes the problem of noisy generalized binary search [1]. This paper explores the connection between the above Bayesian active learning problem and the problem of information acquisition in which a Bayesian decision maker is responsible for dynamically collecting noisy observations so as to enhance her/his information in a speedy manner about an underlying phenomena of interest with a constraint on the probability of error. This view of the problem allows for developing a general lower bound on the expected number of queries to identify the function that generates the labels in terms of the observation noise statistics, the desired probability of error, and the cardinality of the function class. Furthermore, viewing the problem as an information acquisition problem enables a deterministic and Markov heuristic policy based on greedy maximization of Extrinsic Jensen-Shannon divergence [2]. The performance of this heuristic is compared with the state of the art strategies for noisy generalized binary search. In the case where the function class is sample-rich, it is shown that this heuristic is better than previous results and, in particular, matches the earlier proposed lower bound asymptotically.
Keywords :
Bayes methods; Markov processes; decision making; greedy algorithms; learning (artificial intelligence); optimisation; probability; query processing; search problems; Bayesian decision maker; deterministic Markov heuristic policy; error probability; extrinsic Jensen-Shannon divergence; finite label set; function class cardinality; greedy maximization; information acquisition; information acquisition problem; label generating functions; label query selection; noisy Bayesian active learning; noisy generalized binary search; observation noise statistics; Bayes methods; Markov processes; Noise; Noise measurement; Search problems; Upper bound; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communication, Control, and Computing (Allerton), 2012 50th Annual Allerton Conference on
Conference_Location :
Monticello, IL
Print_ISBN :
978-1-4673-4537-8
Type :
conf
DOI :
10.1109/Allerton.2012.6483415
Filename :
6483415
Link To Document :
بازگشت