• DocumentCode
    2002101
  • Title

    Applications of an Web information mining model to data mining and information retrieval tasks

  • Author

    Pereira, Álvaro R., Jr. ; Baeza-Yates, Ricardo

  • Author_Institution
    Dept. of Comput. Sci., Fed. Univ. of Minas Gerais, Belo Horizonte, Brazil
  • fYear
    2005
  • fDate
    22-26 Aug. 2005
  • Firstpage
    1031
  • Lastpage
    1035
  • Abstract
    We have developed a model to mine information in applications involving graph analysis. We demonstrate the model characteristics using a Web warehouse, where nodes represent Web pages and edges represent hyperlinks. In this paper we apply the model to data mining and information retrieval tasks. Related to data mining, we present views for clustering Web nodes and for finding frequent itemsets for association rule mining. Related to information retrieval, we present views for performing a simple query, for clustering the query results and an attempt to improve the quality of the ranking. The use of the model with these purposes demonstrates its modularity, flexibility, applicability and broadness in graph problems.
  • Keywords
    Internet; data mining; data warehouses; graph theory; information retrieval; pattern clustering; Web information mining; Web warehouse; association rule mining; data mining; frequent itemset; graph analysis; information retrieval; pattern clustering; Algebra; Application software; Association rules; Computer science; Data mining; Information analysis; Information retrieval; Itemsets; Object oriented modeling; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database and Expert Systems Applications, 2005. Proceedings. Sixteenth International Workshop on
  • ISSN
    1529-4188
  • Print_ISBN
    0-7695-2424-9
  • Type

    conf

  • DOI
    10.1109/DEXA.2005.52
  • Filename
    1508410