DocumentCode :
1484882
Title :
Clustering with Multiviewpoint-Based Similarity Measure
Author :
Nguyen, Duc Thang ; Chen, Lihui ; Chan, Chee Keong
Author_Institution :
Div. of Inf. Eng., Nanyang Technol. Univ., Singapore, Singapore
Volume :
24
Issue :
6
fYear :
2012
fDate :
6/1/2012 12:00:00 AM
Firstpage :
988
Lastpage :
1001
Abstract :
All clustering methods have to assume some cluster relationship among the data objects that they are applied on. Similarity between a pair of objects can be defined either explicitly or implicitly. In this paper, we introduce a novel multiviewpoint-based similarity measure and two related clustering methods. The major difference between a traditional dissimilarity/similarity measure and ours is that the former uses only a single viewpoint, which is the origin, while the latter utilizes many different viewpoints, which are objects assumed to not be in the same cluster with the two objects being measured. Using multiple viewpoints, more informative assessment of similarity could be achieved. Theoretical analysis and empirical study are conducted to support this claim. Two criterion functions for document clustering are proposed based on this new measure. We compare them with several well-known clustering algorithms that use other popular similarity measures on various document collections to verify the advantages of our proposal.
Keywords :
document handling; pattern clustering; clustering algorithm; data objects; dissimilarity measure; document clustering; informative assessment; multiviewpoint-based similarity measure; Algorithm design and analysis; Clustering algorithms; Current measurement; Euclidean distance; Partitioning algorithms; Proposals; Strontium; Document clustering; similarity measure.; text mining;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2011.86
Filename :
5740893
Link To Document :
بازگشت