Title :
Clustering: Algorithms and Applications
Author_Institution :
CECS Dept., Univ. of Louisville, Louisville, KY
Abstract :
In this paper, we describe algorithms that perform fuzzy clustering and feature weighting simultaneously and in an unsupervised manner. These algorithms are conceptually and computationally simple, and learn a different set of feature weights for each identified cluster. The cluster dependent feature weights offer two advantages. First, they guide the clustering process to partition the data into more meaningful clusters. Second, they can be used in the subsequent steps of a learning system to improve its learning behavior. An extension of the algorithm to deal with an unknown number of clusters is also presented. The extension is based on competitive agglomeration, whereby the number of clusters is over-specified, and adjacent clusters are allowed to compete for data points in a manner that causes clusters which lose in the competition to gradually become depleted and vanish. We illustrate the performance of the proposed approach by using it to segment color images, categorize text document collections, and build a multi-modal thesaurus and use it to annotate image regions.
Keywords :
fuzzy set theory; image colour analysis; image segmentation; pattern clustering; annotate image regions; categorize text document collections; cluster dependent feature weights; color image segmentation; competitive agglomeration; data partition; feature weighting; fuzzy clustering; learning system; multimodal thesaurus; Clustering algorithms; Color; Data mining; Image processing; Image segmentation; Information retrieval; Learning systems; Partitioning algorithms; Shape measurement; Thesauri; Feature weighting; competitive agglomeration; content-based image retrieval; fuzzy clustering; image annotation; image segmentation; multimedia data mining;
Conference_Titel :
Image Processing Theory, Tools and Applications, 2008. IPTA 2008. First Workshops on
Conference_Location :
Sousse
Print_ISBN :
978-1-4244-3321-6
Electronic_ISBN :
978-1-4244-3322-3
DOI :
10.1109/IPTA.2008.4743793