• DocumentCode
    3206414
  • Title

    Improving onboard analysis of Hyperion images by filtering mislabeled training data examples

  • Author

    Rebbapragada, Umaa ; Mandrake, Lukas ; Wagstaff, Kiri L. ; Gleeson, Damhnait ; Castano, Rebecca ; Chien, Steve ; Brodley, Carla E.

  • Author_Institution
    Dept. of Comput. Sci., Tufts Univ., Medford, MA
  • fYear
    2009
  • fDate
    7-14 March 2009
  • Firstpage
    1
  • Lastpage
    9
  • Abstract
    This paper presents PWEM, a technique for detecting class label noise in training data. PWEM detects mislabeled examples by assigning to each training example a probability that its label is correct. PWEM calculates this probability by clustering examples from pairs of classes together and analyzing the distribution of labels within each cluster to derive the probability of each label´s correctness. We discuss how one can use the probabilities output by PWEM to filter, mitigate, or correct mislabeled training examples. We then provide an in-depth discussion of how we applied PWEM to a sulfur detector that labels pixels from Hyperion images of the Borup-Fiord pass in Northern Canada. PWEM assigned a large number of the sulfur training examples low probabilities, indicating severe mislabeling within the sulfur class. The filtering of those low confidence examples resulted in a cleaner training set and improved the median false positive rate of the classifier by at least 29%.
  • Keywords
    expectation-maximisation algorithm; filtering theory; geophysical signal processing; image processing; learning (artificial intelligence); noise; pattern clustering; probability; remote sensing; sulphur; support vector machines; Borup-Fiord pass; Hyperion images; Northern Canada; PWEM; class label noise detection; example clustering; expectation maximization; mislabeled training data filtering; onboard analysis; probability; sulfur detector; support vector machines; Clustering algorithms; Detectors; Filtering; Filters; Humans; Image analysis; Labeling; Pixel; Probability; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Aerospace conference, 2009 IEEE
  • Conference_Location
    Big Sky, MT
  • Print_ISBN
    978-1-4244-2621-8
  • Electronic_ISBN
    978-1-4244-2622-5
  • Type

    conf

  • DOI
    10.1109/AERO.2009.4839580
  • Filename
    4839580