DocumentCode :
1865988
Title :
Nonnegative Matrix Factorization and its application to pattern analysis and text mining
Author :
Zurada, Jacek M. ; Ensari, Tolga ; Asl, Ehsan Hosseini ; Chorowski, Jan
Author_Institution :
Electr. & Comput. Eng., Univ. of Louisville, Louisville, KY, USA
fYear :
2013
fDate :
8-11 Sept. 2013
Firstpage :
11
Lastpage :
16
Abstract :
Nonnegative Matrix Factorization (NMF) is one of the most promising techniques to reduce the dimensionality of the data. This presentation compares the method with other popular matrix decomposition approaches for various pattern analysis tasks. Among others, NMF has been also widely applied for clustering and latent feature extraction. Several types of the objective functions have been used for NMF in the literature. Instead of minimizing the common Euclidean Distance (EucD) error, we review an alternative method that maximizes the correntropy similarity measure to produce the factorization. Correntropy is an entropy-based criterion defined as a nonlinear similarity measure. Following the discussion of maximization of the correntropy function, we use it to cluster document data set and compare the clustering performance with the EucD-based NMF. Our approach was applied and illustrated for the clustering of documents in the 20-Newsgroups data set. The comparison is illustrated with 20-Newsgroups data set. The results show that our approach produces per average better clustering compared with other methods which use EucD as an objective function.
Keywords :
data mining; data reduction; entropy; feature extraction; matrix decomposition; pattern clustering; text analysis; 20-Newsgroups data set; EucD-based NMF; cluster document data set; correntropy similarity measure maximization; data dimensionality reduction; document clustering; entropy-based criterion; latent feature extraction; nonlinear similarity measure; nonnegative matrix factorization; objective function; pattern analysis; text mining; Entropy; Face; Face recognition; Linear programming; Matrix decomposition; Pattern analysis; Principal component analysis; Correntropy; Face recognition; Nonnegative Matrix Factorization; Principal Component Analyis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Information Systems (FedCSIS), 2013 Federated Conference on
Conference_Location :
Krako??w
Type :
conf
Filename :
6643969
Link To Document :
بازگشت