• DocumentCode
    1766523
  • Title

    Active Learning in Context-Driven Stream Mining With an Application to Image Mining

  • Author

    Tekin, Cem ; Van der Schaar, Mihaela

  • Author_Institution
    Dept. of Electr. Eng., Univ. of California at San Diego, La Jolla, CA, USA
  • Volume
    24
  • Issue
    11
  • fYear
    2015
  • fDate
    Nov. 2015
  • Firstpage
    3666
  • Lastpage
    3679
  • Abstract
    We propose an image stream mining method in which images arrive with contexts (metadata) and need to be processed in real time by the image mining system (IMS), which needs to make predictions and derive actionable intelligence from these streams. After extracting the features of the image by preprocessing, IMS determines online the classifier to use on the extracted features to make a prediction using the context of the image. A key challenge associated with stream mining is that the prediction accuracy of the classifiers is unknown, since the image source is unknown; thus, these accuracies need to be learned online. Another key challenge of stream mining is that learning can only be done by observing the true label, but this is costly to obtain. To address these challenges, we model the image stream mining problem as an active, online contextual experts problem, where the context of the image is used to guide the classifier selection decision. We develop an active learning algorithm and show that it achieves regret sublinear in the number of images that have been observed so far. To further illustrate and assess the performance of our proposed methods, we apply them to diagnose breast cancer from the images of cellular samples obtained from the fine needle aspirate of breast mass. Our findings show that very high diagnosis accuracy can be achieved by actively obtaining only a small fraction of true labels through surgical biopsies. Other applications include video surveillance and video traffic monitoring.
  • Keywords
    data mining; feature extraction; image classification; learning (artificial intelligence); pattern classification; prediction theory; IMS; active learning algorithm; breast cancer diagnosis; classifier selection decision; context-driven stream mining; feature extraction; image mining system; image stream mining method; prediction theory; surgical biopsies; video surveillance; video traffic monitoring; Accuracy; Algorithm design and analysis; Breast cancer; Context; Feature extraction; Prediction algorithms; Streaming media; Image stream mining; active learning; breast cancer diagnosis; contextual experts; online classification; online learning;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7149
  • Type

    jour

  • DOI
    10.1109/TIP.2015.2446936
  • Filename
    7126997