Title :
Latent topic estimation based on events in a document
Author :
Kitajima, Risa ; Kobayashi, Ichiro
Author_Institution :
Grad. Sch. of Humanities & Sci., Ochanomizu Univ., Tokyo, Japan
Abstract :
Recently, several latent topic model-based methods such as LSI, pLSI, and LDA have been widely used for text analysis. However, those methods basically assign topics to words, and therefore the relationship between words in a document is not considered. Considering this, we propose a latent topic extraction method which assigns topics to events that represent the relation between words in a document. There are several ways to express events, and the accuracy of estimating latent topics differs depending on the definition of an event. Therefore, we propose several event types and examine which event type works well to estimate latent topics in a document with a common document retrieval task. Furthermore, as an application of our proposed method, we also show a multi-document summarization based on latent topics.
Keywords :
document handling; information retrieval; document retrieval task; event definition; latent topic estimation; latent topic extraction method; multidocument summarization; text analysis; Accuracy;
Conference_Titel :
Natural Language Processing andKnowledge Engineering (NLP-KE), 2011 7th International Conference on
Conference_Location :
Tokushima
Print_ISBN :
978-1-61284-729-0
DOI :
10.1109/NLPKE.2011.6138211