DocumentCode :
2814827
Title :
Free-Traversing Syntactic and Semantic Comparison on Semi-Structured Languages
Author :
Moon, Hyun-Joo ; Yoo, Jae-Woo
Author_Institution :
Dept. Cultural Contents, Hankuk Univ. of Foreign Studies, Seoul
fYear :
2008
fDate :
28-30 Aug. 2008
Firstpage :
747
Lastpage :
752
Abstract :
Tree traversing for syntactic and semantic comparison causes expensive time and space consumption. Internet, heterogeneous computing environments, and ubiquitous computing technologies all cause an explosive increase of Web data, and most Web data is written in semi-structured language format. With the growth of Web data usage and the importance of the management, comparison techniques such as similarity detection are more and more needed for efficient information and database management. This paper introduces a free-traversing technique without tree traversing on parse trees generated by the corresponding language parser to analyze its syntactic and semantic meaning. This free-traversing technique uses DIES (direct invariant encoding scheme) encoding method and has similar results with DFS (depth first search) of parse tree traversing. We use XML schema DTDs to evaluate our free-traversing technique. We adopt some of ontological technologies, and apply LCS (longest common string) and LNS (longest nesting common string) structure extraction methods. With this free-traversing technique, semi-structured Web data management can be much easier and faster than existing tree traversing methods.
Keywords :
XML; grammars; ontologies (artificial intelligence); programming language semantics; tree searching; XML schema DTD; depth first search; direct invariant encoding scheme; free traversing; language parser; longest nesting common string; ontological technology; parse trees; semantic meaning; semistructured Web data management; semistructured language; structure extraction; syntactic meaning; tree traversing; Databases; Encoding; Explosives; Information management; Internet; Ontologies; Pervasive computing; Space technology; Ubiquitous computing; XML; Document Comparison; Free-Traversing; Semi-structured Web Data; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Convergence and Hybrid Information Technology, 2008. ICHIT '08. International Conference on
Conference_Location :
Daejeon
Print_ISBN :
978-0-7695-3328-5
Type :
conf
DOI :
10.1109/ICHIT.2008.241
Filename :
4622917
Link To Document :
بازگشت