DocumentCode
2336252
Title
A new method of text categorization based on PA and Kohonen network
Author
Xu, Jian-Suo ; Wang, Zheng-Ou
Author_Institution
Inst. of Syst. Eng., Tianjin Univ., China
Volume
3
fYear
2004
fDate
26-29 Aug. 2004
Firstpage
1324
Abstract
This paper presents a new method of text categorization by using the theory of pattern aggregation (PA) and Kohonen network. The Kohonen network is applied to realizing text categorization, which has a defect of too slowly speed of training, and so we apply supervising method to training network. Therefore, the speed and the precision of classifying are improved. However, to text vector of high dimension, the speed of classifying is still very slow using Kohonen network. Even the result of text categorization cannot be acquired. The new method establishes vector space model of term weight by the theory of PA, which enhances the function of the words from the viewpoint of categorization effect, and decreases the dimension of vector through eliminating redundant features. Therefore the new method advances largely the speed and the precision of text categorization.
Keywords
learning (artificial intelligence); pattern classification; self-organising feature maps; text analysis; Kohonen network; pattern aggregation theory; pattern classification; supervising method; text categorization; training network; training speed; vector dimension reduction; vector space model; Cybernetics; Entropy; Feature extraction; Frequency; Learning systems; Machine learning; Mutual information; Pattern recognition; Text categorization; Tree data structures;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
Print_ISBN
0-7803-8403-2
Type
conf
DOI
10.1109/ICMLC.2004.1381978
Filename
1381978
Link To Document