Title :
New Methods of Data Clustering and Classification Based on NMF
Author :
Tang, Jie ; Xinyu Ceng ; Peng, Bo
Author_Institution :
Southwest Pet. Univ., Chengdu, China
Abstract :
Nonnegative matrix factorization method is a kind of new matrix decomposition method. It is an effective tool for large data processing and analysis. At the same time, NMF has an important performance on intelligent information processing and pattern recognition. This paper first analyses and discusses the NMF algorithms based on its basic theory. We then propose new methods of data clustering and classification based on NMF separately. NMF method is applied to reduce the dimension of the original matrix. We run clustering algorithms on the encoded matrix after NMF processing instead of on the original matrix. Running clustering algorithms on smaller encoded matrix can save more time and storage space. After that, we bring in a series of improvement methods of classification on the basis of clustering. Finally we have done experiments to test and verify them, and gotten good results.
Keywords :
data analysis; data mining; matrix decomposition; pattern classification; pattern clustering; NMF; data analysis; data classification method; data clustering method; data mining; data processing; intelligent information processing; nonnegative matrix factorization method; pattern recognition; Accuracy; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Matrix decomposition; Support vector machines; Training; Data classification; Data clustering; K-means; NMF; encoded matrix;
Conference_Titel :
Business Computing and Global Informatization (BCGIN), 2011 International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4577-0788-9
Electronic_ISBN :
978-0-7695-4464-9
DOI :
10.1109/BCGIn.2011.114