DocumentCode :
3782992
Title :
Automatic category generation for text documents by self-organizing maps
Author :
Hsin-Chang Yang; Chung-Hong Lee
Author_Institution :
Dept. of Inf. Manage., Chang Jung Univ., Tainan, Taiwan
Volume :
3
fYear :
2000
Firstpage :
581
Abstract :
One important task for text data mining is automatic text categorization, which assigns a text document to some predefined category according to their correlations. Traditionally, these categories as well as the correlations among them are determined bp human experts. In this paper, we devised a novel approach to automatically generate categories. The self-organizing map model is used to generate two maps, namely the word cluster map and the document cluster map, in which a neuron represents a cluster of words and documents respectively. Our approach is to analyze the document cluster map to find centroids of some super-clusters. We also devised a method to select the category term from the word cluster map. The hierarchical structure of categories may be generated by recursively applying the same method. Text categorization is the natural consequence of such automatic category generation process.
Keywords :
"Self organizing feature maps","Text categorization","Neurons","Data mining","Humans","Information management","Natural language processing","Indexing","Ergonomics","Clustering algorithms"
Publisher :
ieee
Conference_Titel :
Neural Networks, 2000. IJCNN 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on
ISSN :
1098-7576
Print_ISBN :
0-7695-0619-4
Type :
conf
DOI :
10.1109/IJCNN.2000.861377
Filename :
861377
Link To Document :
بازگشت