DocumentCode
2002101
Title
Applications of an Web information mining model to data mining and information retrieval tasks
Author
Pereira, Álvaro R., Jr. ; Baeza-Yates, Ricardo
Author_Institution
Dept. of Comput. Sci., Fed. Univ. of Minas Gerais, Belo Horizonte, Brazil
fYear
2005
fDate
22-26 Aug. 2005
Firstpage
1031
Lastpage
1035
Abstract
We have developed a model to mine information in applications involving graph analysis. We demonstrate the model characteristics using a Web warehouse, where nodes represent Web pages and edges represent hyperlinks. In this paper we apply the model to data mining and information retrieval tasks. Related to data mining, we present views for clustering Web nodes and for finding frequent itemsets for association rule mining. Related to information retrieval, we present views for performing a simple query, for clustering the query results and an attempt to improve the quality of the ranking. The use of the model with these purposes demonstrates its modularity, flexibility, applicability and broadness in graph problems.
Keywords
Internet; data mining; data warehouses; graph theory; information retrieval; pattern clustering; Web information mining; Web warehouse; association rule mining; data mining; frequent itemset; graph analysis; information retrieval; pattern clustering; Algebra; Application software; Association rules; Computer science; Data mining; Information analysis; Information retrieval; Itemsets; Object oriented modeling; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Database and Expert Systems Applications, 2005. Proceedings. Sixteenth International Workshop on
ISSN
1529-4188
Print_ISBN
0-7695-2424-9
Type
conf
DOI
10.1109/DEXA.2005.52
Filename
1508410
Link To Document