Title :
Tri-layer-cluster Generation Model for Activity Prediction
Author :
Dandan Zhu ; Fukazawa, Yoshiaki ; Ota, Jun
Author_Institution :
Res. into Artifacts, Center for Eng. (RACE), Univ. of Tokyo, Chiba, Japan
Abstract :
We propose a topic model capable of generating tri-layer clusters, each of which is composed of a topic layer, an activity layer and a word layer. The objective is to better predict activities involved in documents by considering general topics of the activities for clustering. The proposed model is a supervised topic model based on the Latent Dirichlet Allocation (LDA). As a follow-up study of word-pair generation LDA (wpLDA) model, the model introduces the topic-specific activity distribution as an external input, with an activity node inserted into the main generation thread. In addition, we refer to D. Ramage et al.´s one-to-one correspondence to directly learn word-activity tags. An experiment was conducted to prove the feasibility of this model. We chose ten top-listed activities from the wish clusters obtained by the previous wpLDA research, and used each as the key words to extract thirty tweets for training and five for testing, respectively, tagging the tweets with the corresponding activities. By applying the proposed model, we obtained the expected tri-layer clusters in the training phase. Then, in the testing phase, we utilized the activity-specific word distribution derived from the training results to learn the activities of the testing documents. The Stanford Classifier was put forward as the control group, and the activity prediction accuracy demonstrates that the proposed model exhibits the superiority in multi-activity prediction.
Keywords :
document handling; pattern classification; pattern clustering; Stanford classifier; activity layer; activity prediction; latent Dirichlet allocation; supervised topic model; topic layer; topic-specific activity distribution; tri-layer-cluster generation model; word layer; word-pair generation LDA model; Accuracy; Computational modeling; Conferences; Electronic mail; Predictive models; Testing; Training; LDA model; activity prediction; tri-layer cluster; wpLDA;
Conference_Titel :
Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2013 IEEE/WIC/ACM International Joint Conferences on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4799-2902-3
DOI :
10.1109/WI-IAT.2013.51