• DocumentCode
    52594
  • Title

    Evolutionary Bayesian Rose Trees

  • Author

    Shixia Liu ; Xiting Wang ; Yangqiu Song ; Baining Guo

  • Author_Institution
    Sch. of Software, Tsinghua Univ., Beijing, China
  • Volume
    27
  • Issue
    6
  • fYear
    2015
  • fDate
    June 1 2015
  • Firstpage
    1533
  • Lastpage
    1546
  • Abstract
    We present an evolutionary multi-branch tree clustering method to model hierarchical topics and their evolutionary patterns overtime. The method builds evolutionary trees in a Bayesian online filtering framework. The tree construction is formulated as an online posterior estimation problem, which well balances both the fitness of the current tree and the smoothness between trees. The state-of-the-art multi-branch clustering method, Bayesian rose trees, is employed to generate a topic tree with a high fitness value. A constraint model is also introduced to preserve the smoothness between trees. A set of comprehensive experiments on real world news data demonstrates that the proposed method better incorporates historical tree information and is more efficient and effective than the traditional evolutionary hierarchical clustering algorithm. In contrast to our previous method [31], we implement two additional baseline algorithms to compare them with our algorithm. We also evaluate the performance of the clustering algorithm based on multiple constraint trees. Furthermore, two case studies are conducted to demonstrate the effectiveness and usefulness of our algorithm in helping users understand the major hierarchical topic evolutionary patterns in text data.
  • Keywords
    evolutionary computation; pattern clustering; trees (mathematics); Bayesian online filtering framework; constraint tree; evolutionary Bayesian rose trees; evolutionary hierarchical clustering algorithm; evolutionary multibranch tree clustering method; evolutionary pattern; hierarchical topic evolutionary pattern; hierarchical topics; multibranch clustering method; online posterior estimation problem; Bayes methods; Binary trees; Clustering algorithms; Electronic mail; Fans; Merging; Vegetation;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2014.2373384
  • Filename
    6964811