DocumentCode :
2772357
Title :
Finding Maximal Fully-Correlated Itemsets in Large Databases
Author :
Duan, Lian ; Street, W. Nick
Author_Institution :
Dept. of Manage. Sci., Univ. of Iowa, Iowa City, IA, USA
fYear :
2009
fDate :
6-9 Dec. 2009
Firstpage :
770
Lastpage :
775
Abstract :
Finding the most interesting correlations among items is essential for problems in many commercial, medical, and scientific domains. Much previous research focuses on finding correlated pairs instead of correlated itemsets in which all items are correlated with each other. When designing gift sets, store shelf arrangements, or Website product categories, we are more interested in correlated itemsets than correlated pairs. We solve this problem by finding maximal fully-correlated itemsets (MFCIs), in which all subsets are closely related to all other subsets. Putting the items in an MFCI together can promote sales within this itemset. Though some exsiting methods find high-correlation itemsets, they suffer from both efficiency and effectiveness problems in large datasets. In this paper, we explore high-dimensional correlation in two ways. First, we expand the set of desirable properties for correlation measures and study the advantages and disadvantages of various measures. Second, we propose an MFCI framework to decouple the correlation measure from the need for efficient search. By wrapping the best measure in our MFCI framework, we take advantage of likelihood ratio´s superiority in evaluating itemsets, make use of the properties of MFCI to eliminate itemsets with irrelevant items, and still achieve good computational performance.
Keywords :
correlation methods; data mining; database management systems; correlated pairs; correlation measure; large databases; maximal fully-correlated itemsets; Cities and towns; Conference management; Contracts; Data mining; Databases; Itemsets; Marketing and sales; USA Councils; Web page design; Wrapping;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2009. ICDM '09. Ninth IEEE International Conference on
Conference_Location :
Miami, FL
ISSN :
1550-4786
Print_ISBN :
978-1-4244-5242-2
Electronic_ISBN :
1550-4786
Type :
conf
DOI :
10.1109/ICDM.2009.89
Filename :
5360309
Link To Document :
بازگشت