Extracting characteristic words of text using neural networks

Author

Saito, Kazumi ; Nakano, Ryohei

Author_Institution

NTT Commun. Sci. Lab., Kyoto, Japan

Volume

fYear

2004

fDate

25-29 July 2004

Firstpage

1397

Abstract

In this paper, we consider models for estimating categories of documents and extracting characteristic words of such categories. To this end, we focus on three models, i.e., naive Bayes and two types of neural networks formalized as statistical models. Here, suitable categories of documents are estimated based on posterior probabilities, and characteristic words are extracted based on the magnitude of resulting parameter values. In our experiments using a set of real Web pages, we compare these models in the aspect of categorization performances and extraction capabilities of characteristic words.

Keywords

Bayes methods; neural nets; probability; statistical analysis; word processing; characteristic words extraction; naive Bayes; neural networks; posterior probabilities; real Web pages; statistical models; Electronic mail; Frequency; Laboratories; Machine learning algorithms; Neural networks; Probability; Support vector machine classification; Support vector machines; Text mining; Web pages;

fLanguage

English

Publisher

ieee

Conference_Titel

Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on

ISSN

1098-7576

Print_ISBN

0-7803-8359-1

Type

conf

DOI

10.1109/IJCNN.2004.1380154

Filename

1380154

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=423709