Title :
A Hybrid SOM-Based Document Organization System
Author :
Corrêa, Renato Fernandes ; Ludermir, Teresa Bernarda
Author_Institution :
Pernambuco University, Brazil; Federal University of Pernambuco, Brazil
Abstract :
This paper presents and evaluates a hybrid system to self-organization of massive document collections based on Self-Organizing Maps. The hybrid system uses prototypes generated by a clustering algorithm to training the document maps, thus reducing the training time of large maps. We test the system with two clustering algorithms: k-means and the AY method. The experiments were carried out with the Reuters- 21758 v1.0 collection. The performance of the system was measured in terms of text categorization effectiveness on test set and training time. The experimental results show that proposed system generate pretty good document maps and that the system had similar effectiveness performance with both clustering methods, however the use of k-means generated the smallest training time.
Keywords :
Clustering algorithms; Hybrid power systems; Indexing; Informatics; Prototypes; Self organizing feature maps; Stationary state; System testing; Text categorization; Vectors;
Conference_Titel :
Neural Networks, 2006. SBRN '06. Ninth Brazilian Symposium on
Conference_Location :
Ribeirao Preto, Brazil
Print_ISBN :
0-7695-2680-2
DOI :
10.1109/SBRN.2006.3