• DocumentCode
    2334108
  • Title

    Integrating e-commerce and data mining: architecture and challenges

  • Author

    Ansari, Suhail ; Kohavi, Ron ; Mason, Llew ; Zheng, Zijian

  • Author_Institution
    Blue Martini Software, San Mateo, CA, USA
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    27
  • Lastpage
    34
  • Abstract
    We show that the e-commerce domain can provide all the right ingredients for successful data mining. We describe an integrated architecture for supporting this integration. The architecture can dramatically reduce the pre-processing, cleaning, and data understanding effort often documented to take 80% of the time in knowledge discovery projects. We emphasize the need for data collection at the application server layer (not the Web server) in order to support logging of data and metadata that is essential to the discovery process. We describe the data transformation bridges required from the transaction processing systems and customer event streams (e.g., clickstreams) to the data warehouse. We detail the mining workbench, which needs to provide multiple views of the data through reporting, data mining algorithms, visualization, and OLAP. We conclude with a set of challenges
  • Keywords
    data mining; data visualisation; data warehouses; electronic commerce; information resources; meta data; transaction processing; OLAP; application server layer; clickstreams; customer event streams; data collection; data logging; data mining algorithms; data transformation bridges; data understanding; data warehouse; e-commerce domain; e-commerce/data mining integration; integrated architecture; knowledge discovery projects; metadata; mining workbench; multiple views; transaction processing systems; visualization; Bridges; Cleaning; Computer architecture; Data mining; Data visualization; Data warehouses; Fuels; User interfaces; Web pages; Web server;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
  • Conference_Location
    San Jose, CA
  • Print_ISBN
    0-7695-1119-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2001.989497
  • Filename
    989497