DocumentCode :
2710721
Title :
Text Cube: Computing IR Measures for Multidimensional Text Database Analysis
Author :
Lin, Cindy Xide ; Ding, Bolin ; Han, Jiawei ; Zhu, Feida ; Zhao, Bo
Author_Institution :
Dept. of Comput. Sci., Univ. of Illinois at Urbana-Champagin, Urbana, IL
fYear :
2008
fDate :
15-19 Dec. 2008
Firstpage :
905
Lastpage :
910
Abstract :
Since Jim Gray introduced the concept of rdquodata cuberdquo in 1997, data cube, associated with online analytical processing (OLAP), has become a driving engine in data warehouse industry. Because the boom of Internet has given rise to an ever increasing amount of text data associated with other multidimensional information, it is natural to propose a data cube model that integrates the power of traditional OLAP and IR techniques for text. In this paper, we propose a text-cube model on multidimensional text database and study effective OLAP over such data. Two kinds of hierarchies are distinguishable inside: dimensional hierarchy and term hierarchy. By incorporating these hierarchies, we conduct systematic studies on efficient text-cube implementation, OLAP execution and query processing. Our performance study shows the high promise of our methods.
Keywords :
data mining; data models; data warehouses; query processing; text analysis; IR measure; Internet; OLAP; data cube model; data warehouse industry; dimensional hierarchy; multidimensional text database analysis; online analytical processing; query processing; term hierarchy; text cube model; Cost function; Data analysis; Databases; Internet; Material storage; Multidimensional systems; Navigation; Optical computing; Power system modeling; Query processing; Cube; OLAP; Text;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
Conference_Location :
Pisa
ISSN :
1550-4786
Print_ISBN :
978-0-7695-3502-9
Type :
conf
DOI :
10.1109/ICDM.2008.135
Filename :
4781199
Link To Document :
بازگشت