• DocumentCode
    423514
  • Title

    A time-based self-organising model for document clustering

  • Author

    Hung, Chihli ; Wermter, Stefan

  • Author_Institution
    De Lin Inst. of Technol., Taiwan
  • Volume
    1
  • fYear
    2004
  • fDate
    25-29 July 2004
  • Lastpage
    22
  • Abstract
    Most current approaches for document clustering do not consider the non-stationary feature of real world document collection. In this paper, in a non-stationary environment, we propose a new self-organising model, namely the dynamic adaptive self-organising hybrid (DASH) model. The DASH model runs continuously since the new document set is formed consecutively for training while the old document set is still at the training stage. Knowledge learned from the old data set is adjusted to reflect the new data set and therefore document clusters are up-to-date. We test the performance of our model using the Reuters-RCV1 news corpus and obtain promising results based on the criteria of classification accuracy and average quantization error.
  • Keywords
    pattern clustering; self-organising feature maps; average quantization error; classification accuracy criteria; document clustering; dynamic adaptive self-organising hybrid model; time-based self-organising model; Artificial neural networks; Hybrid intelligent systems; Knowledge transfer; Learning systems; Prototypes; Quantization; Space technology; Stress; Technological innovation; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
  • ISSN
    1098-7576
  • Print_ISBN
    0-7803-8359-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2004.1379861
  • Filename
    1379861