• DocumentCode
    62438
  • Title

    An Approach to Supporting Incremental Visual Data Classification

  • Author

    Paiva, Jose Gustavo S. ; Schwartz, William Robson ; Pedrini, Helio ; Minghim, Rosane

  • Author_Institution
    Fac. of Comput. Sci., Fed. Univ. of Uberlandia-UFU, Uberlandia, Brazil
  • Volume
    21
  • Issue
    1
  • fYear
    2015
  • fDate
    Jan. 1 2015
  • Firstpage
    4
  • Lastpage
    17
  • Abstract
    Automatic data classification is a computationally intensive task that presents variable precision and is considerably sensitive to the classifier configuration and to data representation, particularly for evolving data sets. Some of these issues can best be handled by methods that support users´ control over the classification steps. In this paper, we propose a visual data classification methodology that supports users in tasks related to categorization such as training set selection; model creation, application and verification; and classifier tuning. The approach is then well suited for incremental classification, present in many applications with evolving data sets. Data set visualization is accomplished by means of point placement strategies, and we exemplify the method through multidimensional projections and Neighbor Joining trees. The same methodology can be employed by a user who wishes to create his or her own ground truth (or perspective) from a previously unlabeled data set. We validate the methodology through its application to categorization scenarios of image and text data sets, involving the creation, application, verification, and adjustment of classification models.
  • Keywords
    data visualisation; learning (artificial intelligence); pattern classification; application and verification; classification models; classifier configuration; classifier tuning; data representation; ground truth; incremental visual data classification; model creation; multidimensional projection; neighbor joining trees; training set selection; Computational modeling; Data models; Data visualization; Layout; Mathematical model; Training; Visualization; Visual image classification; information visualization; multidimensional point placement;
  • fLanguage
    English
  • Journal_Title
    Visualization and Computer Graphics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1077-2626
  • Type

    jour

  • DOI
    10.1109/TVCG.2014.2331979
  • Filename
    6840370