DocumentCode
423514
Title
A time-based self-organising model for document clustering
Author
Hung, Chihli ; Wermter, Stefan
Author_Institution
De Lin Inst. of Technol., Taiwan
Volume
1
fYear
2004
fDate
25-29 July 2004
Lastpage
22
Abstract
Most current approaches for document clustering do not consider the non-stationary feature of real world document collection. In this paper, in a non-stationary environment, we propose a new self-organising model, namely the dynamic adaptive self-organising hybrid (DASH) model. The DASH model runs continuously since the new document set is formed consecutively for training while the old document set is still at the training stage. Knowledge learned from the old data set is adjusted to reflect the new data set and therefore document clusters are up-to-date. We test the performance of our model using the Reuters-RCV1 news corpus and obtain promising results based on the criteria of classification accuracy and average quantization error.
Keywords
pattern clustering; self-organising feature maps; average quantization error; classification accuracy criteria; document clustering; dynamic adaptive self-organising hybrid model; time-based self-organising model; Artificial neural networks; Hybrid intelligent systems; Knowledge transfer; Learning systems; Prototypes; Quantization; Space technology; Stress; Technological innovation; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
ISSN
1098-7576
Print_ISBN
0-7803-8359-1
Type
conf
DOI
10.1109/IJCNN.2004.1379861
Filename
1379861
Link To Document