Title of article :
A Probabilistic Topic Model based on an Arbitrary-Length Co-occurrence Window
Author/Authors :
زاهدي مرتضي 1321 نويسنده فني و مهندسي zahedi morteza , رحيمي مرضيه نويسنده مرکز تحقيقات گياهان دارويي- دانشگاه علوم پزشکي شهرکرد، شهرکرد، ايران Rahimi M , مشايخي هدي نويسنده
Pages :
7
From page :
19
Abstract :
Probabilistic topic models have been very popular in automatic text analysis since their introduction. These models work based on word co-occurrence, but are not very flexible with respect to the context in which co-occurrence is considered. Many probabilistic topic models do not allow for taking local or spatial data into account. In this paper, we introduce a probabilistic topic model that benefits from an arbitrary-length co-occurrence window and encodes local word dependencies for extracting topics. We assume a multinomial distribution with Dirichlet prior over the window positions to let the words in every position have a chance to influence topic assignments. In the proposed model, topics being shown by word pairs have a more meaningful presentation. The model is applied on a dataset of 2000 documents. The proposed model produces interesting meaningful topics and reduces the problem of sparseness.
Journal title :
Astroparticle Physics
Serial Year :
2017
Record number :
2409983
Link To Document :
بازگشت