Title :
An OLAP data model driven approach to process statistical tables
Author :
Luk, Wo-Shun ; Leung, Philip
Author_Institution :
Sch. of Comput. Sci., Simon Fraser Univ., Burnaby, BC, Canada
Abstract :
Statistical tables belong to an important subset of tables published in the Web, because they represent up-to-date, vital information sources for decision makers. These tables are often carefully designed for easy reading by analysts, and then mechanically produced by an OLAP database system. The general practice of extracting attribute-value pairs from statistical tables does not ensure high accuracy when they are used as a database for an information retrieval system. In this paper, we show how a human may visualize a statistical table as an multidimensional object, defined by a suitably modified OLAP model. In this way, the keywords are classified into semantically distinct groups, i.e., dimension hierarchies, without any ontological knowledge or resorting to machine learning. A prototype system which mimics the human reasoning for table processing has been implemented. Experiments on 150 randomly chosen tables from statistics Canada have confirmed the validity of this approach.
Keywords :
Internet; data mining; data models; query processing; Internet; OLAP database system; attribute-value pair; data model; information retrieval system; statistical table; Data mining; Data models; Database systems; Humans; Information retrieval; Machine learning; Multidimensional systems; Ontologies; Visual databases; Visualization;
Conference_Titel :
Database and Expert Systems Applications, 2005. Proceedings. Sixteenth International Workshop on
Print_ISBN :
0-7695-2424-9
DOI :
10.1109/DEXA.2005.144