Title :
HIREL: An Incremental Clustering Algorithm for Relational Datasets
Author :
Li, Tao ; Anand, Sarabjot S.
Author_Institution :
Dept. of Comput. Sci., Univ. of Warwick Coventry, Coventry
Abstract :
Traditional clustering approaches usually analyze static datasets in which objects are kept unchanged after being processed, but many practical datasets are dynamically modified which means some previously learned patterns have to be updated accordingly. Re-clustering the whole dataset from scratch is not a good choice due to the frequent data modifications and the limited out-of-service time, so the development of incremental clustering approaches is highly desirable. Besides that, propositional clustering algorithms are not suitable for relational datasets because of their quadratic computational complexity. In this paper, we propose an incremental clustering algorithm that requires only one pass of the relational dataset. The utilization of the Representative Objects and the balanced Search Tree greatly accelerate the learning procedure. Experimental results prove the effectiveness of our algorithm.
Keywords :
learning (artificial intelligence); pattern clustering; relational databases; tree searching; HIREL; balanced search tree; incremental clustering algorithm; learning procedure; relational dataset; representative object utilization; Algorithm design and analysis; Clustering algorithms; Computational complexity; Computer science; Data analysis; Data mining; Data warehouses; Pattern analysis; Publishing; Relational databases; Clustering; Incremental; Relational;
Conference_Titel :
Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
Conference_Location :
Pisa
Print_ISBN :
978-0-7695-3502-9
DOI :
10.1109/ICDM.2008.116