• DocumentCode
    2336252
  • Title

    A new method of text categorization based on PA and Kohonen network

  • Author

    Xu, Jian-Suo ; Wang, Zheng-Ou

  • Author_Institution
    Inst. of Syst. Eng., Tianjin Univ., China
  • Volume
    3
  • fYear
    2004
  • fDate
    26-29 Aug. 2004
  • Firstpage
    1324
  • Abstract
    This paper presents a new method of text categorization by using the theory of pattern aggregation (PA) and Kohonen network. The Kohonen network is applied to realizing text categorization, which has a defect of too slowly speed of training, and so we apply supervising method to training network. Therefore, the speed and the precision of classifying are improved. However, to text vector of high dimension, the speed of classifying is still very slow using Kohonen network. Even the result of text categorization cannot be acquired. The new method establishes vector space model of term weight by the theory of PA, which enhances the function of the words from the viewpoint of categorization effect, and decreases the dimension of vector through eliminating redundant features. Therefore the new method advances largely the speed and the precision of text categorization.
  • Keywords
    learning (artificial intelligence); pattern classification; self-organising feature maps; text analysis; Kohonen network; pattern aggregation theory; pattern classification; supervising method; text categorization; training network; training speed; vector dimension reduction; vector space model; Cybernetics; Entropy; Feature extraction; Frequency; Learning systems; Machine learning; Mutual information; Pattern recognition; Text categorization; Tree data structures;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
  • Print_ISBN
    0-7803-8403-2
  • Type

    conf

  • DOI
    10.1109/ICMLC.2004.1381978
  • Filename
    1381978