• DocumentCode
    21744
  • Title

    Visual Methods for Analyzing Probabilistic Classification Data

  • Author

    Alsallakh, Bilal ; Hanbury, Allan ; Hauser, Helwig ; Miksch, Silvia ; Rauber, Andreas

  • Author_Institution
    Vienna Univ. of Technol., Vienna, Austria
  • Volume
    20
  • Issue
    12
  • fYear
    2014
  • fDate
    Dec. 31 2014
  • Firstpage
    1703
  • Lastpage
    1712
  • Abstract
    Multi-class classifiers often compute scores for the classification samples describing probabilities to belong to different classes. In order to improve the performance of such classifiers, machine learning experts need to analyze classification results for a large number of labeled samples to find possible reasons for incorrect classification. Confusion matrices are widely used for this purpose. However, they provide no information about classification scores and features computed for the samples. We propose a set of integrated visual methods for analyzing the performance of probabilistic classifiers. Our methods provide insight into different aspects of the classification results for a large number of samples. One visualization emphasizes at which probabilities these samples were classified and how these probabilities correlate with classification error in terms of false positives and false negatives. Another view emphasizes the features of these samples and ranks them by their separation power between selected true and false classifications. We demonstrate the insight gained using our technique in a benchmarking classification dataset, and show how it enables improving classification performance by interactively defining and evaluating post-classification rules.
  • Keywords
    data handling; learning (artificial intelligence); matrix algebra; pattern classification; probability; benchmarking classification dataset; classification samples; confusion matrices; integrated visual methods; machine learning; multiclass classifiers; probabilistic classification data analysis; visual methods; Data visualization; Electric breakdown; Histograms; Image color analysis; Probabilistic logic; Probability; Probabilistic classification; confusion analysis; feature evaluation and selection; visual inspection;
  • fLanguage
    English
  • Journal_Title
    Visualization and Computer Graphics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1077-2626
  • Type

    jour

  • DOI
    10.1109/TVCG.2014.2346660
  • Filename
    6875957