DocumentCode :
3142263
Title :
Latent topic estimation based on events in a document
Author :
Kitajima, Risa ; Kobayashi, Ichiro
Author_Institution :
Grad. Sch. of Humanities & Sci., Ochanomizu Univ., Tokyo, Japan
fYear :
2011
fDate :
27-29 Nov. 2011
Firstpage :
289
Lastpage :
295
Abstract :
Recently, several latent topic model-based methods such as LSI, pLSI, and LDA have been widely used for text analysis. However, those methods basically assign topics to words, and therefore the relationship between words in a document is not considered. Considering this, we propose a latent topic extraction method which assigns topics to events that represent the relation between words in a document. There are several ways to express events, and the accuracy of estimating latent topics differs depending on the definition of an event. Therefore, we propose several event types and examine which event type works well to estimate latent topics in a document with a common document retrieval task. Furthermore, as an application of our proposed method, we also show a multi-document summarization based on latent topics.
Keywords :
document handling; information retrieval; document retrieval task; event definition; latent topic estimation; latent topic extraction method; multidocument summarization; text analysis; Accuracy;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing andKnowledge Engineering (NLP-KE), 2011 7th International Conference on
Conference_Location :
Tokushima
Print_ISBN :
978-1-61284-729-0
Type :
conf
DOI :
10.1109/NLPKE.2011.6138211
Filename :
6138211
Link To Document :
بازگشت