Title :
Category Mining by Heterogeneous Data Fusion Using PdLSI Model in a Retail Service
Author :
Ishigaki, Tsukasa ; Takenaka, Takeshi ; Motomura, Yoichi
Author_Institution :
Data Based Modeling Res. Team, Nat. Inst. of Adv. Ind. Sci. & Technol. (AIST), Tokyo, Japan
Abstract :
This paper describes an appropriate category discovery method that simultaneously involves a customer´s lifestyle category and item category for the sustainable management of retail services, designated as ``category mining´´. Category mining is realized using a large-scale ID-POS data and customer´s questionnaire responses with respect to their lifestyle. For the heterogeneous data fusion, we propose a probabilistic double-latent semantic indexing (PdLSI) model that is an extension of PLSI model. In the PdLSI model, customers and items are classified probabilistically into some latent lifestyle categories and latent item category. Then, understanding of relation between the latent categories and various purchased situations is realized using Bayesian network modeling. This method provides useful knowledge based on a large-scale data for efficient customer relationship management and category management, and can be applicable for other service industries.
Keywords :
belief networks; customer relationship management; data mining; probability; retail data processing; sensor fusion; service industries; Bayesian network; category discovery; category management; category mining; customer relationship management; heterogeneous data fusion; item classification; large scale ID POS data; latent item category; latent lifestyle category; probabilistic double latent semantic indexing method; retail service industries; Bayesian network; heterogeneous data fusion; large-scale ID-POS data; service engineering; topic model;
Conference_Titel :
Data Mining (ICDM), 2010 IEEE 10th International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-1-4244-9131-5
Electronic_ISBN :
1550-4786
DOI :
10.1109/ICDM.2010.83