• DocumentCode
    423709
  • Title

    Extracting characteristic words of text using neural networks

  • Author

    Saito, Kazumi ; Nakano, Ryohei

  • Author_Institution
    NTT Commun. Sci. Lab., Kyoto, Japan
  • Volume
    2
  • fYear
    2004
  • fDate
    25-29 July 2004
  • Firstpage
    1397
  • Abstract
    In this paper, we consider models for estimating categories of documents and extracting characteristic words of such categories. To this end, we focus on three models, i.e., naive Bayes and two types of neural networks formalized as statistical models. Here, suitable categories of documents are estimated based on posterior probabilities, and characteristic words are extracted based on the magnitude of resulting parameter values. In our experiments using a set of real Web pages, we compare these models in the aspect of categorization performances and extraction capabilities of characteristic words.
  • Keywords
    Bayes methods; neural nets; probability; statistical analysis; word processing; characteristic words extraction; naive Bayes; neural networks; posterior probabilities; real Web pages; statistical models; Electronic mail; Frequency; Laboratories; Machine learning algorithms; Neural networks; Probability; Support vector machine classification; Support vector machines; Text mining; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
  • ISSN
    1098-7576
  • Print_ISBN
    0-7803-8359-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2004.1380154
  • Filename
    1380154