DocumentCode
3782992
Title
Automatic category generation for text documents by self-organizing maps
Author
Hsin-Chang Yang; Chung-Hong Lee
Author_Institution
Dept. of Inf. Manage., Chang Jung Univ., Tainan, Taiwan
Volume
3
fYear
2000
Firstpage
581
Abstract
One important task for text data mining is automatic text categorization, which assigns a text document to some predefined category according to their correlations. Traditionally, these categories as well as the correlations among them are determined bp human experts. In this paper, we devised a novel approach to automatically generate categories. The self-organizing map model is used to generate two maps, namely the word cluster map and the document cluster map, in which a neuron represents a cluster of words and documents respectively. Our approach is to analyze the document cluster map to find centroids of some super-clusters. We also devised a method to select the category term from the word cluster map. The hierarchical structure of categories may be generated by recursively applying the same method. Text categorization is the natural consequence of such automatic category generation process.
Keywords
"Self organizing feature maps","Text categorization","Neurons","Data mining","Humans","Information management","Natural language processing","Indexing","Ergonomics","Clustering algorithms"
Publisher
ieee
Conference_Titel
Neural Networks, 2000. IJCNN 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on
ISSN
1098-7576
Print_ISBN
0-7695-0619-4
Type
conf
DOI
10.1109/IJCNN.2000.861377
Filename
861377
Link To Document