• DocumentCode
    2872070
  • Title

    A Hybrid SOM-Based Document Organization System

  • Author

    Corrêa, Renato Fernandes ; Ludermir, Teresa Bernarda

  • Author_Institution
    Pernambuco University, Brazil; Federal University of Pernambuco, Brazil
  • fYear
    2006
  • fDate
    23-27 Oct. 2006
  • Firstpage
    90
  • Lastpage
    95
  • Abstract
    This paper presents and evaluates a hybrid system to self-organization of massive document collections based on Self-Organizing Maps. The hybrid system uses prototypes generated by a clustering algorithm to training the document maps, thus reducing the training time of large maps. We test the system with two clustering algorithms: k-means and the AY method. The experiments were carried out with the Reuters- 21758 v1.0 collection. The performance of the system was measured in terms of text categorization effectiveness on test set and training time. The experimental results show that proposed system generate pretty good document maps and that the system had similar effectiveness performance with both clustering methods, however the use of k-means generated the smallest training time.
  • Keywords
    Clustering algorithms; Hybrid power systems; Indexing; Informatics; Prototypes; Self organizing feature maps; Stationary state; System testing; Text categorization; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2006. SBRN '06. Ninth Brazilian Symposium on
  • Conference_Location
    Ribeirao Preto, Brazil
  • Print_ISBN
    0-7695-2680-2
  • Type

    conf

  • DOI
    10.1109/SBRN.2006.3
  • Filename
    4026816