• DocumentCode
    3537044
  • Title

    Alternative Approach to Tree-Structured Web Log Representation and Mining

  • Author

    Hadzic, Fedja ; Hecker, Michael

  • Author_Institution
    DEBII, Curtin Univ., Perth, WA, Australia
  • Volume
    1
  • fYear
    2011
  • fDate
    22-27 Aug. 2011
  • Firstpage
    235
  • Lastpage
    242
  • Abstract
    More recent approaches to web log data representation aim to capture the user navigational patterns with respect to the overall structure of the web site. One such representation is tree-structured log files which is the focus of this work. Most existing methods for analyzing such data are based on the use of frequent sub tree mining techniques to extract frequent user activity and navigational paths. In this paper we evaluate the use of other standard data mining techniques enabled by a recently proposed structure preserving flat data representation for tree-structured data. The initially proposed framework was adjusted to better suit the web log mining task. Experimental evaluation is performed on two real world web log datasets and comparisons are made with an existing state-of-the-art classifier for tree-structured data. The results show the great potential of the method in enabling the application of a wider range of data mining/analysis techniques to tree-structured web log data.
  • Keywords
    Web sites; data mining; tree data structures; Web log data representation; Web log mining task; Web site; data mining; frequent sub tree mining techniques; tree-structured log files; Data mining; Encoding; Itemsets; Navigation; Web pages; tree-structured web logs; web usage mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011 IEEE/WIC/ACM International Conference on
  • Conference_Location
    Lyon
  • Print_ISBN
    978-1-4577-1373-6
  • Electronic_ISBN
    978-0-7695-4513-4
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2011.156
  • Filename
    6036753