DocumentCode :
457395
Title :
Semi-Parametric Model-Based Clustering for DNA Microarray Data
Author :
Han, Bohyung ; Davis, Larry S.
Author_Institution :
Digital Media Solutions Lab., Samsung Inf. Syst. America, Irvine, CA
Volume :
3
fYear :
0
fDate :
0-0 0
Firstpage :
324
Lastpage :
327
Abstract :
Various clustering methods have been proposed for the analysis of gene expression data, but conventional clustering algorithms have several critical limitations; how to set parameters such as number of clusters, initial cluster centers, and so on. In this paper, we propose a semi-parametric model-based clustering algorithm in which the underlying model is a mixture of Gaussian. Each gene expression data builds a Gaussian kernel, and the uncertainty of microarray data is naturally integrated in the data representation. Our algorithm provides a principled method to automatically determine parameters - number of components in the mixture, mean, covariance, and weight of each Gaussian - by mean-shift procedure (Comaniciu and Meer, 1999) and curvature fitting. After the initialization, expectation maximization (EM) algorithm is employed for clustering to achieve maximum likelihood (ML). The performance of our algorithm is compared with standard EM algorithm using real data as well as synthetic data
Keywords :
DNA; Gaussian processes; biology computing; genetics; pattern clustering; DNA microarray data; Gaussian kernel; Gaussian mixtures; curvature fitting; data representation; expectation maximization; gene expression data; maximum likelihood; mean-shift procedure; semiparametric model-based clustering; Algorithm design and analysis; Clustering algorithms; Clustering methods; DNA; Gene expression; Information analysis; Kernel; Parameter estimation; Pattern recognition; Uncertainty;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
Conference_Location :
Hong Kong
ISSN :
1051-4651
Print_ISBN :
0-7695-2521-0
Type :
conf
DOI :
10.1109/ICPR.2006.1044
Filename :
1699531
Link To Document :
بازگشت