DocumentCode :
3165568
Title :
Succinct Matrix Approximation and Efficient k-NN Classification
Author :
Liu, Rong ; Shi, Yong
Author_Institution :
CAS Sch. of Math. Sci., Beijing
fYear :
2007
fDate :
28-31 Oct. 2007
Firstpage :
213
Lastpage :
222
Abstract :
This work reveals that instead of the polynomial bounds in previous literatures there exists a sharper bound of exponential form for the L2 norm of an arbitrary shaped random matrix. Based on the newly elaborated bound, a nonuniform sampling method is presented to succinctly approximate a matrix with a sparse binary one, and thus relieves the computation loads of k-NN classifier in both time and storage. The method is also pass-efficient because sampling and quantizing are combined together in a single step and the whole process can be completed within one pass over the input matrix. In the evaluations on compression ratio and reconstruction error, the sampling method exhibits impressive capability in providing succinct and tight approximations for the input matrices. The most significant finding in the classification experiment is that the k-NN classifier based on the approximation can even outperform the standard one. This provides another strong evidence for the claim that our method is especially capable in capturing intrinsic characteristics.
Keywords :
data mining; pattern classification; polynomial matrices; sampling methods; sparse matrices; arbitrary shaped random matrix; compression ratio; data mining; k-NN classification; matrix approximation; nonuniform sampling method; polynomial bounds; reconstruction error; sparse matrix; Approximation algorithms; Computer vision; Content addressable storage; Data mining; Information retrieval; Machine learning; Machine learning algorithms; Sampling methods; Sparse matrices; Symmetric matrices;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on
Conference_Location :
Omaha, NE
ISSN :
1550-4786
Print_ISBN :
978-0-7695-3018-5
Type :
conf
DOI :
10.1109/ICDM.2007.41
Filename :
4470245
Link To Document :
بازگشت