DocumentCode :
3630476
Title :
Structural poisson mixtures for classification of documents
Author :
Jiri Grim;Jana Novovicova;Petr Somol
Author_Institution :
Institute of Information Theory and Automation, P.O.BOX 18, 18208 Prague 8, Czech Republic
fYear :
2008
Firstpage :
1
Lastpage :
4
Abstract :
Considering the statistical text classification problem we approximate class-conditional probability distributions by structurally modified Poisson mixtures. By introducing the structural model we can use different subsets of input variables to evaluate conditional probabilities of different classes in the Bayes formula. The method is applicable to document vectors of arbitrary dimension without any preprocessing. The structural optimization can be included into the EM algorithm in a statistically correct way.
Keywords :
"Vocabulary","Text categorization","Frequency","Probability distribution","Machine learning","Bayesian methods","Information theory","Automation","Input variables","Machine learning algorithms"
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
ISSN :
1051-4651
Print_ISBN :
978-1-4244-2174-9
Type :
conf
DOI :
10.1109/ICPR.2008.4761669
Filename :
4761669
Link To Document :
بازگشت