• DocumentCode
    244936
  • Title

    UnTangle: Visual Mining for Data with Uncertain Multi-labels via Triangle Map

  • Author

    Yu-Ru Lin ; Cao, Nianxia ; Gotz, David ; Lu Lu

  • Author_Institution
    Univ. of Pittsburgh, Pittsburgh, PA, USA
  • fYear
    2014
  • fDate
    14-17 Dec. 2014
  • Firstpage
    340
  • Lastpage
    349
  • Abstract
    Data with multiple uncertain labels are common in many situations. For examples, a movie may be associated with multiple genres with different levels of confidence, and a protein sequence may be probabilistically assigned to several structural subcategories. Despite their ubiquity, the problem of visualizing uncertain labels has not been adequately addressed. Existing approaches often either discard the uncertainty information, or map the data to a low-dimensional subspace where their associations with multiple labels are obscured. In this paper, we propose a novel visual mining technique, UnTangle, for visualizing uncertain multi-labels. In our proposed visualization, data items are placed inside a web of connected triangles, with labels assigned to the triangle vertices such that nearby labels are more relevant to each other. The positions of the data items are determined based on the probabilistic associations between items and labels. UnTangle provides both (a) an automatic label placement algorithm, and (b) adaptive interaction mechanisms that allow users to control the label positioning for different visual queries. Our work makes a unique contribution by providing an effective way to investigate the relationship between data items and their uncertain labels, as well as the relationships among labels. Our user study suggests that the visualization effectively helps users discover emergent patterns and compare the nuances of uncertainty information in the data labels.
  • Keywords
    data mining; data visualisation; UnTangle; adaptive interaction mechanisms; automatic label placement algorithm; data items; data mining; emergent pattern discovery; label positioning control; low-dimensional subspace; multiple genres; multiple uncertain labels; probabilistic associations; protein sequence; triangle map; uncertain multilabel visualization; uncertain multilabels; uncertainty information; visual mining; visual queries; visualizing uncertain labels; Data mining; Data visualization; Distributed databases; Motion pictures; Probabilistic logic; Uncertainty; Visualization; multi-labels; probablistic labels; ternary plot; uncertainty data; visual mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2014 IEEE International Conference on
  • Conference_Location
    Shenzhen
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4799-4303-6
  • Type

    conf

  • DOI
    10.1109/ICDM.2014.24
  • Filename
    7023351