DocumentCode :
2118835
Title :
Intuitive Topic Discovery by Incorporating Word-Pair´s Connection Into LDA
Author :
Dandan Zhu ; Fukazawa, Yoshiaki ; Karapetsas, E. ; Ota, Jun
Author_Institution :
Res. into Artifacts Center for Eng. (RACE), Univ. of Tokyo, Chiba, Japan
Volume :
1
fYear :
2012
fDate :
4-7 Dec. 2012
Firstpage :
303
Lastpage :
310
Abstract :
We demonstrate a generative model that incorporates word-pair connection into the smoothed LDA model to intuitively discover people´s wish related activities. The widely used model, LDA topic model, generally generates clusters in the form of separate words. However, this form is not intuitive enough to express people´s activities. Therefore, we consider the word-pairs led by verbs can better describe users´ intentions and activities, and we prefer to present this collocation under topics as the clustering results. We mathematically present the relatedness between verbs and non-verb words through association rule, and build the physical connection of word-pairs and possible topics. By incorporating the connection lattice into the smoothed LDA, the word-pair LDA model is created. In the experiments, Twitter posts about “new year´s resolutions” were chosen as the data source. The results show that the proposed model performs well on perplexity, and presents excellent intuitive character.
Keywords :
data mining; natural language processing; pattern clustering; social networking (online); word processing; Latent Dirichlet allocation; association rule; data source; generative model; intuitive character; intuitive topic discovery; new year resolutions; nonverb words; people wish related activities; smoothed LDA model; user activities; user intention; verb words; word-pair LDA topic model; word-pair connection; LDA model; association rules; connection lattice; intuitive expressions; twitter posts;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2012 IEEE/WIC/ACM International Conferences on
Conference_Location :
Macau
Print_ISBN :
978-1-4673-6057-9
Type :
conf
DOI :
10.1109/WI-IAT.2012.205
Filename :
6511901
Link To Document :
بازگشت