DocumentCode
730332
Title
Online time-dependent clustering using probabilistic topic models
Author
Renard, Benjamin ; Kharratzadeh, Milad ; Coates, Mark
Author_Institution
Dept. of Electr. & Comput. Eng., McGill Univ., Montreal, QC, Canada
fYear
2015
fDate
19-24 April 2015
Firstpage
2036
Lastpage
2040
Abstract
We introduce an online, time-dependent clustering algorithm that employs a dynamic probabilistic topic model. The proposed algorithm can handle data that evolves over time and strives to capture the evolution of clusters in the dataset. It addresses the case where the entire dataset is not available at once (e.g., the case of data streams) but an up-to-date clustering of the data at any given time is required. One of the main challenges of the data stream setting is that the computational cost and memory overhead must stay bounded as the number of data points increases. Our proposed algorithm has a Dirichlet process-based generative component combined with a sequential Monte Carlo sampler for posterior inference. We also introduce a novel modification to the sampling process, called targeted sampling, which enhances the performance of the SMC sampler. We test the performance of our algorithm with both synthetic and real datasets.
Keywords
Monte Carlo methods; pattern clustering; Dirichlet process-based generative component; SMC sampler; dynamic probabilistic topic model; online time-dependent clustering algorithm; posterior inference; sequential Monte Carlo sampler; targeted sampling; Clustering algorithms; Computational modeling; Data models; Heuristic algorithms; Indexes; Mixture models; Monte Carlo methods; Clustering; Dirichlet Process; Sequential Monte Carlo Sampling; Targeted Sampling;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7178328
Filename
7178328
Link To Document