DocumentCode :
423709
Title :
Extracting characteristic words of text using neural networks
Author :
Saito, Kazumi ; Nakano, Ryohei
Author_Institution :
NTT Commun. Sci. Lab., Kyoto, Japan
Volume :
2
fYear :
2004
fDate :
25-29 July 2004
Firstpage :
1397
Abstract :
In this paper, we consider models for estimating categories of documents and extracting characteristic words of such categories. To this end, we focus on three models, i.e., naive Bayes and two types of neural networks formalized as statistical models. Here, suitable categories of documents are estimated based on posterior probabilities, and characteristic words are extracted based on the magnitude of resulting parameter values. In our experiments using a set of real Web pages, we compare these models in the aspect of categorization performances and extraction capabilities of characteristic words.
Keywords :
Bayes methods; neural nets; probability; statistical analysis; word processing; characteristic words extraction; naive Bayes; neural networks; posterior probabilities; real Web pages; statistical models; Electronic mail; Frequency; Laboratories; Machine learning algorithms; Neural networks; Probability; Support vector machine classification; Support vector machines; Text mining; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
ISSN :
1098-7576
Print_ISBN :
0-7803-8359-1
Type :
conf
DOI :
10.1109/IJCNN.2004.1380154
Filename :
1380154
Link To Document :
بازگشت