Title :
Compositional Generative Mapping for Tree-Structured Data—Part I: Bottom-Up Probabilistic Modeling of Trees
Author :
Bacciu, Davide ; Micheli, Andrea ; Sperduti, Alessandro
Author_Institution :
Dipt. di Inf., Univ. di Pisa, Pisa, Italy
Abstract :
We introduce a novel compositional (recursive) probabilistic model for trees that defines an approximated bottom-up generative process from the leaves to the root of a tree. The proposed model defines contextual state transitions from the joint configuration of the children to the parent nodes. We argue that the bottom-up context postulates different probabilistic assumptions with respect to a top-down approach, leading to different representational capabilities. We discuss classes of applications that are best suited to a bottom-up approach. In particular, the bottom-up context is shown to better correlate and model the co-occurrence of substructures among the child subtrees of internal nodes. A mixed memory approximation is introduced to factorize the joint children-to-parent state transition matrix as a mixture of pairwise transitions. The proposed approach is the first practical bottom-up generative model for tree-structured data that maintains the same computational class of its top-down counterpart. Comparative experimental analyses exploiting synthetic and real-world datasets show that the proposed model can deal with deep structures better than a top-down generative model. The model is also shown to better capture structural information from real-world data comprising trees with a large out-degree. The proposed bottom-up model can be used as a fundamental building block for the development of other new powerful models.
Keywords :
approximation theory; directed graphs; matrix algebra; tree data structures; approximated bottom-up generative process; bottom-up probabilistic modeling; child subtrees; compositional generative mapping; compositional probabilistic model; contextual state transitions; internal nodes; joint children-to-parent state transition matrix; mixed memory approximation; practical bottom-up generative model; probabilistic assumptions; real-world datasets; top-down generative model; tree out-degree; tree-structured data; Approximation methods; Computational modeling; Data models; Hidden Markov models; Probabilistic logic; Tree data structures; Bottom-up processing; hidden recursive model; hidden tree Markov model; tree-structured data;
Journal_Title :
Neural Networks and Learning Systems, IEEE Transactions on
DOI :
10.1109/TNNLS.2012.2222044