Title :
Incremental clustering for dynamic document databases
Author :
Can, Fazli ; Drochak, Nicklas D., II
Author_Institution :
Miami Univ., FL, USA
Abstract :
Clustering of very large document databases is a necessity to reduce the space time complexity of retrieval operations. Continuous update of clusters is required owing to the dynamic nature of document databases. An algorithm for incremental clustering is introduced. Its complexity analysis and cost comparison with reclustering are provided. It is shown through empirical testing that the incremental clustering is as effective as the reclustering by producing clusters that are very similar. This similarity is then shown not to be by chance. In the experiments it is shown that the information retrieval effectiveness of the algorithm is compatible with the reclustering algorithm, which is known to have good retrieval performance
Keywords :
computational complexity; database management systems; information retrieval systems; complexity analysis; cost comparison; dynamic document databases; empirical testing; incremental clustering; information retrieval effectiveness; reclustering; retrieval operations; retrieval performance; space time complexity; very large document databases; Clustering algorithms; Costs; Databases; Information retrieval; Machine assisted indexing; Testing;
Conference_Titel :
Applied Computing, 1990., Proceedings of the 1990 Symposium on
Conference_Location :
Fayetteville, AR
Print_ISBN :
0-8186-2031-5
DOI :
10.1109/SOAC.1990.82141