DocumentCode :
2002166
Title :
An OLAP data model driven approach to process statistical tables
Author :
Luk, Wo-Shun ; Leung, Philip
Author_Institution :
Sch. of Comput. Sci., Simon Fraser Univ., Burnaby, BC, Canada
fYear :
2005
fDate :
22-26 Aug. 2005
Firstpage :
1054
Lastpage :
1058
Abstract :
Statistical tables belong to an important subset of tables published in the Web, because they represent up-to-date, vital information sources for decision makers. These tables are often carefully designed for easy reading by analysts, and then mechanically produced by an OLAP database system. The general practice of extracting attribute-value pairs from statistical tables does not ensure high accuracy when they are used as a database for an information retrieval system. In this paper, we show how a human may visualize a statistical table as an multidimensional object, defined by a suitably modified OLAP model. In this way, the keywords are classified into semantically distinct groups, i.e., dimension hierarchies, without any ontological knowledge or resorting to machine learning. A prototype system which mimics the human reasoning for table processing has been implemented. Experiments on 150 randomly chosen tables from statistics Canada have confirmed the validity of this approach.
Keywords :
Internet; data mining; data models; query processing; Internet; OLAP database system; attribute-value pair; data model; information retrieval system; statistical table; Data mining; Data models; Database systems; Humans; Information retrieval; Machine learning; Multidimensional systems; Ontologies; Visual databases; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database and Expert Systems Applications, 2005. Proceedings. Sixteenth International Workshop on
ISSN :
1529-4188
Print_ISBN :
0-7695-2424-9
Type :
conf
DOI :
10.1109/DEXA.2005.144
Filename :
1508414
Link To Document :
بازگشت