DocumentCode :
2369805
Title :
Cluster validation: An integrative method for cluster analysis
Author :
Visvanathan, Mahesh ; Adagarla, B.S. ; Gerald, H.L. ; Smith, Peter
Author_Institution :
Bioinf. Core Facility, Univ. of Kansas, Lawrence, KS, USA
fYear :
2009
fDate :
1-4 Nov. 2009
Firstpage :
238
Lastpage :
242
Abstract :
Clustering is a widely used to discover underlying patterns and groups in data and there is a need to validate the quality of clusters generated by the numerous clustering algorithms in use. The need for cluster validitation arises from the fundamental definition of unsupervised learning. As clustering is an unsupervised learning process, the prediction of correct number of clusters is a hurdle which can be cleared by using cluster validity indices to assess the quality of the clusters. We have developed a tool for cluster validation as a part of GOAPhAR, a web based tool that integrates from disparate sources, information regarding gene annotations, protein annotations, identifiers associated with probe sets, functional pathways, protein interactions, gene Ontology and publicly available microarray datasets. Our cluster validity tool calculates three indices to indicate clustering quality viz. the Silhouette, Dunn´s and Davies-Bouldin indices and outputs them to the user. The values of these indices can be used to judge the quality of clustering and to optimize the process of selecting an appropriate clustering algorithm and number of clusters.
Keywords :
bioinformatics; ontologies (artificial intelligence); pattern clustering; proteins; unsupervised learning; cluster analysis; cluster validation; cluster validity tool; clustering algorithms; clustering quality; functional pathways; gene annotations; gene ontology; protein annotations; protein interactions; unsupervised learning process; Algorithm design and analysis; Bioinformatics; Clustering algorithms; Data mining; Gene expression; Genomics; Partitioning algorithms; Pattern analysis; Protein engineering; Unsupervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine Workshop, 2009. BIBMW 2009. IEEE International Conference on
Conference_Location :
Washington, DC
Print_ISBN :
978-1-4244-5121-0
Type :
conf
DOI :
10.1109/BIBMW.2009.5332101
Filename :
5332101
Link To Document :
بازگشت