DocumentCode :
2872070
Title :
A Hybrid SOM-Based Document Organization System
Author :
Corrêa, Renato Fernandes ; Ludermir, Teresa Bernarda
Author_Institution :
Pernambuco University, Brazil; Federal University of Pernambuco, Brazil
fYear :
2006
fDate :
23-27 Oct. 2006
Firstpage :
90
Lastpage :
95
Abstract :
This paper presents and evaluates a hybrid system to self-organization of massive document collections based on Self-Organizing Maps. The hybrid system uses prototypes generated by a clustering algorithm to training the document maps, thus reducing the training time of large maps. We test the system with two clustering algorithms: k-means and the AY method. The experiments were carried out with the Reuters- 21758 v1.0 collection. The performance of the system was measured in terms of text categorization effectiveness on test set and training time. The experimental results show that proposed system generate pretty good document maps and that the system had similar effectiveness performance with both clustering methods, however the use of k-means generated the smallest training time.
Keywords :
Clustering algorithms; Hybrid power systems; Indexing; Informatics; Prototypes; Self organizing feature maps; Stationary state; System testing; Text categorization; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2006. SBRN '06. Ninth Brazilian Symposium on
Conference_Location :
Ribeirao Preto, Brazil
Print_ISBN :
0-7695-2680-2
Type :
conf
DOI :
10.1109/SBRN.2006.3
Filename :
4026816
Link To Document :
بازگشت