DocumentCode :
730332
Title :
Online time-dependent clustering using probabilistic topic models
Author :
Renard, Benjamin ; Kharratzadeh, Milad ; Coates, Mark
Author_Institution :
Dept. of Electr. & Comput. Eng., McGill Univ., Montreal, QC, Canada
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
2036
Lastpage :
2040
Abstract :
We introduce an online, time-dependent clustering algorithm that employs a dynamic probabilistic topic model. The proposed algorithm can handle data that evolves over time and strives to capture the evolution of clusters in the dataset. It addresses the case where the entire dataset is not available at once (e.g., the case of data streams) but an up-to-date clustering of the data at any given time is required. One of the main challenges of the data stream setting is that the computational cost and memory overhead must stay bounded as the number of data points increases. Our proposed algorithm has a Dirichlet process-based generative component combined with a sequential Monte Carlo sampler for posterior inference. We also introduce a novel modification to the sampling process, called targeted sampling, which enhances the performance of the SMC sampler. We test the performance of our algorithm with both synthetic and real datasets.
Keywords :
Monte Carlo methods; pattern clustering; Dirichlet process-based generative component; SMC sampler; dynamic probabilistic topic model; online time-dependent clustering algorithm; posterior inference; sequential Monte Carlo sampler; targeted sampling; Clustering algorithms; Computational modeling; Data models; Heuristic algorithms; Indexes; Mixture models; Monte Carlo methods; Clustering; Dirichlet Process; Sequential Monte Carlo Sampling; Targeted Sampling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178328
Filename :
7178328
Link To Document :
بازگشت