DocumentCode
658372
Title
Tri-layer-cluster Generation Model for Activity Prediction
Author
Dandan Zhu ; Fukazawa, Yoshiaki ; Ota, Jun
Author_Institution
Res. into Artifacts, Center for Eng. (RACE), Univ. of Tokyo, Chiba, Japan
Volume
1
fYear
2013
fDate
17-20 Nov. 2013
Firstpage
359
Lastpage
366
Abstract
We propose a topic model capable of generating tri-layer clusters, each of which is composed of a topic layer, an activity layer and a word layer. The objective is to better predict activities involved in documents by considering general topics of the activities for clustering. The proposed model is a supervised topic model based on the Latent Dirichlet Allocation (LDA). As a follow-up study of word-pair generation LDA (wpLDA) model, the model introduces the topic-specific activity distribution as an external input, with an activity node inserted into the main generation thread. In addition, we refer to D. Ramage et al.´s one-to-one correspondence to directly learn word-activity tags. An experiment was conducted to prove the feasibility of this model. We chose ten top-listed activities from the wish clusters obtained by the previous wpLDA research, and used each as the key words to extract thirty tweets for training and five for testing, respectively, tagging the tweets with the corresponding activities. By applying the proposed model, we obtained the expected tri-layer clusters in the training phase. Then, in the testing phase, we utilized the activity-specific word distribution derived from the training results to learn the activities of the testing documents. The Stanford Classifier was put forward as the control group, and the activity prediction accuracy demonstrates that the proposed model exhibits the superiority in multi-activity prediction.
Keywords
document handling; pattern classification; pattern clustering; Stanford classifier; activity layer; activity prediction; latent Dirichlet allocation; supervised topic model; topic layer; topic-specific activity distribution; tri-layer-cluster generation model; word layer; word-pair generation LDA model; Accuracy; Computational modeling; Conferences; Electronic mail; Predictive models; Testing; Training; LDA model; activity prediction; tri-layer cluster; wpLDA;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2013 IEEE/WIC/ACM International Joint Conferences on
Conference_Location
Atlanta, GA
Print_ISBN
978-1-4799-2902-3
Type
conf
DOI
10.1109/WI-IAT.2013.51
Filename
6690037
Link To Document