Title :
Minimum tree edit distance between XML and Probabilistic XML documents
Author :
Haitao Ma ; Changming Xu ; Miao Fang ; Yu Changyong
Author_Institution :
Coll. of Inf. Sci. & Eng., Northeastern Univ. Shenyang, Shenyang, China
Abstract :
A Probabilistic XML document is a data model representing data as a probabilistic distribution over ordinary XML documents, which are called possible worlds. This paper studies the problem of tree edit distance between an XML document and a probabilistic XML document. In particular, We define a minimum tree edit distance between an XML document and the possible worlds of a probabilistic XML document. We investigate the problem and propose an algorithm for computing the minimum tree edit distance, which runtime is polynomial in the size of the probabilistic XML document. Finally, our experimental evaluation on synthetic XML documents confirms our analytic results.
Keywords :
XML; data models; probability; XML document; minimum tree edit distance; polynomial runtime; probabilistic XML document; probabilistic distribution; tree edit distance; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Computational modeling; Probabilistic logic; Programming; XML; XML documents; probabilistic XML; tree edit distance;
Conference_Titel :
Electronics, Computer and Applications, 2014 IEEE Workshop on
Conference_Location :
Ottawa, ON
DOI :
10.1109/IWECA.2014.6845639