Title :
Fast Kernel for Calculating Structural Information Similarities
Author :
Wei, Jin-Mao ; Wang, Shu-Qin ; Wang, Jing ; You, Jun-Ping
Author_Institution :
Inst. of Comput. Intelligence, Northeast Normal Univ., Changchun
Abstract :
Structural similarity computation plays a crucial role in many applications such as in searching similar documents, in comparing chemical compounds, in finding genetic similarities, etc. We propose in this paper to use structural information content (SIC) for measuring structural information, considering both the nodes and edges of trees. We utilize a binary encoding approach for assigning the weights of different layer nodes and determining if some tree is a subtree of another tree. By defining a fast kernel and recursively computing SICs, we evaluate the structural information similarities of data trees to pattern trees. In the paper, we present the algorithm for calculating SICs with computation complexity of O(n), and use simple examples to instantiate the performance of the proposed method
Keywords :
XML; computational complexity; content management; encoding; trees (mathematics); XML; binary encoding; computational complexity; data trees; kernel; pattern trees; sructural similarity computation; structural information content; structural pattern analysis; Chemical compounds; Clustering algorithms; Encoding; Genetics; Intelligent structures; Intelligent systems; Kernel; Pattern analysis; Silicon carbide; XML; Structural information content; structural pattern analysis; structural similarity; web mining;
Conference_Titel :
Intelligent Systems, 2006 3rd International IEEE Conference on
Conference_Location :
London
Print_ISBN :
1-4244-01996-8
Electronic_ISBN :
1-4244-01996-8
DOI :
10.1109/IS.2006.348394