Title :
A Rank-Based Approach to Active Diagnosis
Author :
Bellala, G. ; Stanley, Jon ; Bhavnani, S.K. ; Scott, Clayton
Author_Institution :
Hewlett Packard Labs., Palo Alto, CA, USA
Abstract :
The problem of active diagnosis arises in several applications such as disease diagnosis and fault diagnosis in computer networks, where the goal is to rapidly identify the binary states of a set of objects (e.g., faulty or working) by sequentially selecting, and observing, potentially noisy responses to binary valued queries. Previous work in this area chooses queries sequentially based on Information gain, and the object states are inferred by maximum a posteriori (MAP) estimation. In this work, rather than MAP estimation, we aim to rank objects according to their posterior fault probability. We propose a greedy algorithm to choose queries sequentially by maximizing the area under the ROC curve associated with the ranked list. The proposed algorithm overcomes limitations of existing work. When multiple faults may be present, the proposed algorithm does not rely on belief propagation, making it feasible for large scale networks with little loss in performance. When a single fault is present, the proposed algorithm can be implemented without knowledge of the underlying query noise distribution, making it robust to any misspecification of these noise parameters. We demonstrate the performance of the proposed algorithm through experiments on computer networks, a toxic chemical database, and synthetic datasets.
Keywords :
belief networks; fault diagnosis; learning (artificial intelligence); maximum likelihood estimation; production engineering computing; query processing; MAP estimation; ROC curve; active diagnosis; belief propagation; binary valued query; computer network; disease diagnosis; fault diagnosis; greedy algorithm; information gain; maximum a posteriori estimation; noise parameter; posterior fault probability; query noise distribution; rank-based approach; receiver operating characteristic curve; synthetic dataset; toxic chemical database; Approximation methods; Computer networks; Diseases; Entropy; Fault diagnosis; Noise; Noise measurement; Active diagnosis; Bayesian network; active learning; area under the ROC curve; persistent noise; Algorithms; Area Under Curve; Artificial Intelligence; Bayes Theorem; Diagnosis, Computer-Assisted; ROC Curve;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
DOI :
10.1109/TPAMI.2013.30