مرکز منطقه ای اطلاع رساني علوم و فناوري - Clustering with Extended Constraints by Co-Training

DocumentCode :

2112255

Title :

Clustering with Extended Constraints by Co-Training

Author :

Okabe, Masayuki ; Yamada, Shigeru

Author_Institution :

Inf. & Media Center, Toyohashi Univ. of Technol., Toyohashi, Japan

Volume :

fYear :

2012

fDate :

4-7 Dec. 2012

Firstpage :

Lastpage :

Abstract :

Constrained Clustering is a data mining technique that produces clusters of similar data by using pre-given constraints about data pairs. If we consider using constrained clustering for some practical interactive systems such as information retrieval or recommendation systems, the cost of constraint preparation will be the problem as well as other machine learning techniques. In this paper, we propose a method to complement the lack of constraints by using co-training framework, which extends training examples by leveraging two kinds of feature sets. Our method is based on a constrained clustering ensemble algorithm that integrates a set of clusters produced by a constrained k-means with random ordered data assignment, and runs the same algorithm on two different feature sets to extend constraints. We evaluate our method on a Web page dataset that provides two different feature sets. The results show that our method achieves the performance improvement by using co-training approach.

Keywords :

data mining; learning (artificial intelligence); pattern clustering; Web page dataset; clustering ensemble algorithm; constrained clustering; constrained k-means algorithm; constraint preparation; cotraining framework; data cluster; data mining technique; information retrieval; machine learning technique; random ordered data assignment; recommendation system; Cluster ensemble; Co-training; Constrained clustering;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Web Intelligence and Intelligent Agent Technology (WI-IAT), 2012 IEEE/WIC/ACM International Conferences on

Conference_Location :

Macau

Print_ISBN :

978-1-4673-6057-9

Type :

conf

DOI :

10.1109/WI-IAT.2012.113

Filename :

6511653

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2112255