Title :
Regularized nonnegative matrix factorization for clustering gene expression data
Author :
Weixiang Liu ; Tianfu Wang ; Siping Chen
Author_Institution :
Dept. of Biomedicai Eng., Shenzhen Univ., Shenzhen, China
Abstract :
Recently nonnegative Matrix Factorization (NMF) has been proven a powerful method in clustering analysis of gene expression data. There exist two popular loss functions for minimization in decomposition: one is Euclidean distance and the other generalized Kullback-Leibler divergence. Both loss functions can be derived from a linear model with additive noise, and the Euclidean distance loss corresponds to Gaussian noise while the generalized Kullback-Leibler divergence corresponds to Poisson noise. However real data is not only Gaussian or Poisson, or not both. In order to take into account complex type of noise, we combine both loss functions for NMF according to regularization method. We compared NMF based on Euclidean distance, the generalized Kullback-Leibler divergence, and our regularized version, with application in clustering gene expression data. The experimental results demonstrate the effectiveness of the proposed method.
Keywords :
Gaussian noise; Poisson distribution; biology computing; genetics; matrix decomposition; minimisation; statistical analysis; Euclidean distance; Gaussian noise; NMF; Poisson noise; additive noise; clustering analysis; decomposition; gene expression data clustering; generalized Kullback-Leibler divergence; linear model; loss functions; minimization; regularized nonnegative matrix factorization; regularized version; Accuracy; Algorithm design and analysis; Bioinformatics; Data analysis; Euclidean distance; Gene expression; Noise;
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2013 IEEE International Conference on
Conference_Location :
Shanghai
DOI :
10.1109/BIBM.2013.6732613