Title :
Hot item mining and summarization from multiple auction Web sites
Author :
Wong, Tak-Lam ; Lam, Wai
Author_Institution :
Dept. of Syst. Eng. & Eng. Manage., The Chinese Univ. of Hong Kong, Shatin, China
Abstract :
Online auction Web sites are fast changing, highly dynamic, and complex as they involve tremendous sellers and potential buyers, as well as a huge amount of items listed for bidding. We develop a two-phase framework which aims at mining and summarizing hot items from multiple auction Web sites to assist decision making. The objective of the first phase is to automatically extract the product features and product feature values of the items from the descriptions provided by the sellers. We design a HMM-based learning method to train an extended HMM model which can adapt to the unseen Web page from which the information is extracted. The goal of the second phase is to discover and summarize the hot items based on the extracted information. We formulate the hot item mining task as a semi-supervised learning problem and employ the graph mincuts algorithm to accomplish this task. The summary of the hot items is then generated by considering the frequency and the position of the product features being mentioned in the descriptions. We have conducted extensive experiments from several real-world auction Web sites to demonstrate the effectiveness of our framework.
Keywords :
Web sites; data mining; electronic commerce; graph theory; hidden Markov models; learning (artificial intelligence); HMM-based learning; graph mincuts algorithm; hidden Markov model; hot item mining; hot item summarization; multiple auction Web sites; online auction Web sites; product feature extraction; semisupervised learning; Data mining; Decision making; Feature extraction; Hidden Markov models; Learning systems; Potential well; Research and development management; Semisupervised learning; Systems engineering and theory; Web pages;
Conference_Titel :
Data Mining, Fifth IEEE International Conference on
Print_ISBN :
0-7695-2278-5
DOI :
10.1109/ICDM.2005.78