Title :
Active Learning in Context-Driven Stream Mining With an Application to Image Mining
Author :
Tekin, Cem ; Van der Schaar, Mihaela
Author_Institution :
Dept. of Electr. Eng., Univ. of California at San Diego, La Jolla, CA, USA
Abstract :
We propose an image stream mining method in which images arrive with contexts (metadata) and need to be processed in real time by the image mining system (IMS), which needs to make predictions and derive actionable intelligence from these streams. After extracting the features of the image by preprocessing, IMS determines online the classifier to use on the extracted features to make a prediction using the context of the image. A key challenge associated with stream mining is that the prediction accuracy of the classifiers is unknown, since the image source is unknown; thus, these accuracies need to be learned online. Another key challenge of stream mining is that learning can only be done by observing the true label, but this is costly to obtain. To address these challenges, we model the image stream mining problem as an active, online contextual experts problem, where the context of the image is used to guide the classifier selection decision. We develop an active learning algorithm and show that it achieves regret sublinear in the number of images that have been observed so far. To further illustrate and assess the performance of our proposed methods, we apply them to diagnose breast cancer from the images of cellular samples obtained from the fine needle aspirate of breast mass. Our findings show that very high diagnosis accuracy can be achieved by actively obtaining only a small fraction of true labels through surgical biopsies. Other applications include video surveillance and video traffic monitoring.
Keywords :
data mining; feature extraction; image classification; learning (artificial intelligence); pattern classification; prediction theory; IMS; active learning algorithm; breast cancer diagnosis; classifier selection decision; context-driven stream mining; feature extraction; image mining system; image stream mining method; prediction theory; surgical biopsies; video surveillance; video traffic monitoring; Accuracy; Algorithm design and analysis; Breast cancer; Context; Feature extraction; Prediction algorithms; Streaming media; Image stream mining; active learning; breast cancer diagnosis; contextual experts; online classification; online learning;
Journal_Title :
Image Processing, IEEE Transactions on
DOI :
10.1109/TIP.2015.2446936