Title :
Heterogeneous Data Mining in Search Advertisement Click Rates
Author :
Liping, Zhu ; Lianen, Ji ; Wensheng, Guo
Abstract :
Search advertising is typical heterogeneous multi-dimensional data sets whose click-through rate depends on query words, terms and other factors. Traditional data mining methods, limited to homogenous data source, represent search ads as the vector space model, so they fail to sufficiently consider the search advertisements´ characteristics of heterogeneous data. This paper presents consistent bipartite graph model to describe ads, adopting spectral co-clustering method in data mining. In order to solve the balance partition of the map in clustering, heuristic algorithm is introduced into consistent bipartite graph´s co-partition; a more effective subgraph redistribution algorithm is established. Experiments on real ads dataset shows that our approach worked effectively and efficiently in both clustering and prediction.
Keywords :
advertising data processing; data mining; graph theory; pattern clustering; bipartite graph model; heterogeneous data mining; search advertisement click rates; spectral coclustering method; subgraph redistribution algorithm; vector space model; Advertising; Bipartite graph; Clustering algorithms; Data mining; Heuristic algorithms; Information systems; Multidimensional systems; Partitioning algorithms; Predictive models; Sparse matrices; CTR; Data Mining; Spectral Clustering;
Conference_Titel :
Web Information Systems and Mining, 2009. WISM 2009. International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3817-4
DOI :
10.1109/WISM.2009.34