DocumentCode
445864
Title
Weight sharing on naive Bayes document model
Author
Saito, Kazumi ; Nakano, Ryohei
Author_Institution
NTT Commun. Sci. Labs., Kyoto, Japan
Volume
1
fYear
2005
fDate
31 July-4 Aug. 2005
Firstpage
576
Abstract
In this paper, we study weight sharing on the naive Bayes document model. Firstly we consider splitting words into a relatively small number of groups such that words in each group have the same parameter value. This problem can be regarded as a probabilistic parameter sharing task. In this task, we formalize the problem in terms of maximum likelihood estimation, and then propose an algorithm for this purpose. Secondly we focus on an adaptive hyperparameter estimation problem based on prior distributions constructed by using such word groups. This problem can be regarded as a hyperparameter sharing task. In this task, we describe a framework and algorithm, which enables to derive the unique optimal solution in the context of leave-one-out cross validation. In our experiments using a benchmark document set called webkb, we show a series of simulation results using the proposed algorithms.
Keywords
Bayes methods; data mining; document handling; maximum likelihood estimation; hyperparameter estimation; hyperparameter sharing; leave-one-out cross validation; maximum likelihood estimation; naive Bayes document model; probabilistic parameter sharing; webkb; Electronic mail; Frequency conversion; Laboratories; Maximum likelihood estimation; Neural networks; Polynomials; Robustness; Text mining; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2005. IJCNN '05. Proceedings. 2005 IEEE International Joint Conference on
Print_ISBN
0-7803-9048-2
Type
conf
DOI
10.1109/IJCNN.2005.1555895
Filename
1555895
Link To Document