Title :
Improving Personalization Solutions through Optimal Segmentation of Customer Bases
Author :
Jiang, Tianyi ; Tuzhilin, Alexander
Author_Institution :
AvePoint Inc., Jersey, NJ
fDate :
3/1/2009 12:00:00 AM
Abstract :
On the Web, where the search costs are low and the competition is just a mouse click away, it is crucial to segment the customers intelligently in order to offer more targeted and personalized products and services to them. Traditionally, customer segmentation is achieved using statistics-based methods that compute a set of statistics from the customer data and group customers into segments by applying distance-based clustering algorithms in the space of these statistics. In this paper, we present a direct grouping-based approach to computing customer segments that groups customers not based on computed statistics, but in terms of optimally combining transactional data of several customers to build a data mining model of customer behavior for each group. Then, building customer segments becomes a combinatorial optimization problem of finding the best partitioning of the customer base into disjoint groups. This paper shows that finding an optimal customer partition is NP-hard, proposes several suboptimal direct grouping segmentation methods, and empirically compares them among themselves, traditional statistics-based hierarchical and affinity propagation-based segmentation, and one-to-one methods across multiple experimental conditions. It is shown that the best direct grouping method significantly dominates the statistics-based and one-to-one approaches across most of the experimental conditions, while still being computationally tractable. It is also shown that the distribution of the sizes of customer segments generated by the best direct grouping method follows a power law distribution and that microsegmentation provides the best approach to personalization.
Keywords :
combinatorial mathematics; customer services; data mining; electronic commerce; optimisation; pattern clustering; statistical analysis; NP-hard problem; affinity propagation-based segmentation; combinatorial optimization problem; customer bases; customer behavior; customer segmentation; data mining model; direct grouping segmentation methods; direct grouping-based approach; distance-based clustering algorithms; one-to-one methods; optimal segmentation; personalization solutions; power law distribution; statistics-based methods; transactional data; Clustering; Data mining; Personalization; and association rules; classification;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2008.163