DocumentCode
2872070
Title
A Hybrid SOM-Based Document Organization System
Author
Corrêa, Renato Fernandes ; Ludermir, Teresa Bernarda
Author_Institution
Pernambuco University, Brazil; Federal University of Pernambuco, Brazil
fYear
2006
fDate
23-27 Oct. 2006
Firstpage
90
Lastpage
95
Abstract
This paper presents and evaluates a hybrid system to self-organization of massive document collections based on Self-Organizing Maps. The hybrid system uses prototypes generated by a clustering algorithm to training the document maps, thus reducing the training time of large maps. We test the system with two clustering algorithms: k-means and the AY method. The experiments were carried out with the Reuters- 21758 v1.0 collection. The performance of the system was measured in terms of text categorization effectiveness on test set and training time. The experimental results show that proposed system generate pretty good document maps and that the system had similar effectiveness performance with both clustering methods, however the use of k-means generated the smallest training time.
Keywords
Clustering algorithms; Hybrid power systems; Indexing; Informatics; Prototypes; Self organizing feature maps; Stationary state; System testing; Text categorization; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2006. SBRN '06. Ninth Brazilian Symposium on
Conference_Location
Ribeirao Preto, Brazil
Print_ISBN
0-7695-2680-2
Type
conf
DOI
10.1109/SBRN.2006.3
Filename
4026816
Link To Document