Title :
Applications of an Web information mining model to data mining and information retrieval tasks
Author :
Pereira, Álvaro R., Jr. ; Baeza-Yates, Ricardo
Author_Institution :
Dept. of Comput. Sci., Fed. Univ. of Minas Gerais, Belo Horizonte, Brazil
Abstract :
We have developed a model to mine information in applications involving graph analysis. We demonstrate the model characteristics using a Web warehouse, where nodes represent Web pages and edges represent hyperlinks. In this paper we apply the model to data mining and information retrieval tasks. Related to data mining, we present views for clustering Web nodes and for finding frequent itemsets for association rule mining. Related to information retrieval, we present views for performing a simple query, for clustering the query results and an attempt to improve the quality of the ranking. The use of the model with these purposes demonstrates its modularity, flexibility, applicability and broadness in graph problems.
Keywords :
Internet; data mining; data warehouses; graph theory; information retrieval; pattern clustering; Web information mining; Web warehouse; association rule mining; data mining; frequent itemset; graph analysis; information retrieval; pattern clustering; Algebra; Application software; Association rules; Computer science; Data mining; Information analysis; Information retrieval; Itemsets; Object oriented modeling; Web pages;
Conference_Titel :
Database and Expert Systems Applications, 2005. Proceedings. Sixteenth International Workshop on
Print_ISBN :
0-7695-2424-9
DOI :
10.1109/DEXA.2005.52