Title :
Document warehousing: a document-intensive application of a multimedia database
Author :
Ishikawa, Hiroshi ; Ohta, Manabu ; Kato, Koki
Author_Institution :
Tokyo Metropolitan Univ., Japan
Abstract :
Nowadays, structured data such as sales are stored in data warehouses for decision-making. Less-structured data such as HTML texts, XML data, images, and videos are increasingly accumulated in PC storage due to the spread of the Internet technology such as WWW. Such less-structured data, collectively called multimedia documents, are also precious as corporate assets. So we need to provide a document warehouse to analyze and manage multimedia documents for corporate-wide information mining and reuse like a data warehouse. As a document-intensive application of a multimedia database, we describe a prototype document warehouse system, which supports management of documents, keyword-based and content-based retrieval, rule-based classification, SOM-based clustering and XML active query facility based on ECA rules
Keywords :
Internet; content-based retrieval; data mining; data warehouses; document handling; hypermedia markup languages; multimedia databases; visual databases; ECA rules; HTML; Internet; SOM-based clustering; World Wide Web; XML; active query facility; content-based retrieval; corporate-wide information mining; data warehouses; document management; document warehouse; image database; keyword-based retrieval; multimedia database; rule-based classification; structured data; video database; Data warehouses; Decision making; HTML; Image storage; Internet; Marketing and sales; Multimedia databases; Videos; Warehousing; XML;
Conference_Titel :
Research Issues in Data Engineering, 2001. Proceedings. Eleventh International Workshop on
Conference_Location :
Heidelberg
Print_ISBN :
0-7695-0957-6
DOI :
10.1109/RIDE.2001.916488