Title :
Text Classification Based on a Novel Bayesian Hierarchical Model
Author :
Zhou, Shibin ; Li, Kan ; Liu, Yushu
Author_Institution :
Sch. of Comput. Sci. & Technol., Beijing Inst. of Technol., Beijing
Abstract :
In the text literature, many Bayesian generative models were proposed to represent documents and words in order to process text effectively and accurately. As the most popular one of these models, Latent Dirichlet Allocation Model(LDA) did great job in dimensionality reduction for document classification. In this paper, inspiring by latent Dirichlet allocation model, we propose LDCM or latent Dirichlet category model for text classification rather than dimensionality reduction. LDCM estimate parameters of models by variational inference and use variational parameters to estimate maximum a posteriori of terms. As demonstrated by our experimental results, we report satisfactory categorization performances about our method on various real-world text documents.
Keywords :
Bayes methods; data reduction; inference mechanisms; maximum likelihood estimation; pattern classification; text analysis; Bayesian hierarchical generative model; dimensionality reduction; latent Dirichlet allocation model; latent Dirichlet category model; maximum a posteriori estimation; parameter estimation; text classification; text document representation; text processing; variational inference approach; Bayesian methods; Computational efficiency; Computer science; Fuzzy systems; Indexing; Information retrieval; Large scale integration; Linear discriminant analysis; Parameter estimation; Text categorization;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2008. FSKD '08. Fifth International Conference on
Conference_Location :
Shandong
Print_ISBN :
978-0-7695-3305-6
DOI :
10.1109/FSKD.2008.666