• DocumentCode
    2050492
  • Title

    An effective data preprocessing method for Web Usage Mining

  • Author

    Sudheer Reddy, K. ; Kantha Reddy, M. ; Sitaramulu, V.

  • Author_Institution
    Dept. of CSE, Acharya Nagarjuna Univ., Guntur, India
  • fYear
    2013
  • fDate
    21-22 Feb. 2013
  • Firstpage
    7
  • Lastpage
    10
  • Abstract
    Web Usage Mining (WUM) is one of the categories of data mining technique that identifies usage patterns of the web data, so as to perceive and better serve the requirements of the web applications. The working of WUM involves three steps - preprocessing, pattern discovery and analysis. The first step in WUM - Preprocessing of data is an essential activity which will help to improve the quality of the data and successively the mining results. This research paper studies and presents several data preparation techniques of access stream even before the mining process can be started and these are used to improve the performance of the data preprocessing to identify the unique sessions and unique users. The methods proposed will help to discover meaningful pattern and relationships from the access stream of the user and these are proved to be valid and useful by various research tests. The paper is concluded by proposing the future research directions in this space.
  • Keywords
    Internet; data mining; WUM; Web applications; Web data; Web usage mining; analysis step; data mining technique; data preparation techniques; data preprocessing method; data quality improvement; pattern discovery step; preprocessing step; usage pattern identification; Cascading style sheets; Cleaning; Data mining; Data preprocessing; IP networks; Web servers; Data Preprocessing; Path completion; User Session; Web Usage Mining; Web log;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Communication and Embedded Systems (ICICES), 2013 International Conference on
  • Conference_Location
    Chennai
  • Print_ISBN
    978-1-4673-5786-9
  • Type

    conf

  • DOI
    10.1109/ICICES.2013.6508197
  • Filename
    6508197