Title :
Tensor Space Learning for Analyzing Activity Patterns from Video Sequences
Author :
Wang, Liang ; Leckie, Christopher ; Wang, Xiaozhe ; Kotagiri, Ramamohanarao ; Bezdek, and Jim
Abstract :
Statistical topic models such as the Latent Dirichlet Allocation (LDA) have emerged as an attractive framework to model, visualize and summarize large document collections in a completely unsupervised fashion. Considering the enormous sizes of the modern electronic document collections, it is very important that these models are fast and scalable. In this work, we build parallel implementations of the variational EM algorithm for LDA in a multiprocessor architecture as well as a distributed setting. Our experiments on various sized document collections indicate that while both the implementations achieve speed-ups, the distributed version achieves dramatic improvements in both speed and scalability. We also analyze the costs associated with various stages of the EM algorithm and suggest ways to further improve the performance.
Keywords :
abstracting; data visualisation; document handling; expectation-maximisation algorithm; multiprocessing systems; parallel algorithms; variational techniques; vocabulary; electronic document collection summarization; electronic document collection visualization; latent Dirichlet allocation; multiprocessor architecture; parallelized variational EM algorithm; statistical topic model; Computer science; Data mining; Humans; Motion analysis; Motion detection; Motion measurement; Pattern analysis; Pixel; Tensile stress; Video sequences;
Conference_Titel :
Data Mining Workshops, 2007. ICDM Workshops 2007. Seventh IEEE International Conference on
Conference_Location :
Omaha, NE
Print_ISBN :
978-0-7695-3019-2
Electronic_ISBN :
978-0-7695-3033-8
DOI :
10.1109/ICDMW.2007.70