DocumentCode :
2182834
Title :
DTD-Miner: a tool for mining DTD from XML documents
Author :
Moh, Chuang-Hue ; Lim, Ee-Peng ; Ng, Wee-Keong
Author_Institution :
Center for Adv. Inf. Syst., Nanyang Technol. Inst., Singapore
fYear :
2000
fDate :
2000
Firstpage :
144
Lastpage :
151
Abstract :
XML documents are semi-structured and the structure of the documents is embedded in the tags. Although XML documents can be accompanied by a document type definition (DTD) that defines the structure of the documents, the presence of a DTD is not mandatory. The difficulty in deriving the DTD for XML documents lies in the fact that DTDs are of a different syntax from XML and that prior knowledge of the structure of the documents is required. In this paper, we introduce DTD-Miner, an automatic structure mining tool for XML documents. Using a Web-based interface, the user is able to submit a set of similarly structured XML documents and the system automatically suggests a DTD. The user is also able to further refine the DTD generated to reduce the complexity by relaxing some the rules used in the system
Keywords :
data mining; data structures; hypermedia markup languages; information resources; DTD refinement; DTD-Miner; World Wide Web-based interface; document structure mining tool; document type definition; prior knowledge; rule relaxation; similarly structured XML documents; syntax; tags; Banking; Electrical capacitance tomography; HTML; Information systems; Proposals; Web pages; World Wide Web; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Issues of E-Commerce and Web-Based Information Systems, 2000. WECWIS 2000. Second International Workshop on
Conference_Location :
Milpitas, CA
Print_ISBN :
0-7695-0610-0
Type :
conf
DOI :
10.1109/WECWIS.2000.853869
Filename :
853869
Link To Document :
بازگشت