DocumentCode
3249060
Title
Using text mining to infer semantic attributes for retail data mining
Author
Ghani, Rayid ; Fano, Andrew E.
Author_Institution
Accenture Technol. Labs, Chicago, IL, USA
fYear
2002
fDate
2002
Firstpage
195
Lastpage
202
Abstract
Current data mining techniques usually do not have a mechanism to automatically infer semantic features inherent in the data being "mined". The semantics are either injected in the initial stages (by feature construction) or by interpreting the results produced by the algorithms. Both of these techniques have proved effective but require a lot of human effort. In many domains, semantic information is implicitly available and can be extracted automatically to improve data mining systems. In this paper we present a case study of a system that is trained to extract semantic features for apparel products and populate a knowledge base with these products and features. We show that semantic features of these items can be successfully extracted by applying text learning techniques to the descriptions obtained from websites of retailers. We also describe several applications of such a knowledge base of product semantics that we have built including recommender systems and competitive intelligence tools and provide evidence that our approach can successfully build a knowledge base with accurate facts which can then be used to create profiles of individual customers, groups of customers, or entire retail stores.
Keywords
data mining; inference mechanisms; knowledge based systems; retail data processing; text analysis; apparel products; byfeature construction; competitive intelligence tools; product semantics; recommender systems; retail data mining; retailer Web sites; retailer Websites; semantic attribute inference; semantic feature extraction; semantic features; text learning techniques; text mining; Association rules; Competitive intelligence; Data analysis; Data engineering; Data mining; Decision trees; Humans; Neural networks; Recommender systems; Text mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
Print_ISBN
0-7695-1754-4
Type
conf
DOI
10.1109/ICDM.2002.1183903
Filename
1183903
Link To Document