Title :
Classification Related Manifold Dimension Estimation with Restricted Boltzmann Machine
Author :
Kezhen Teng ; Jinqiao Wang
Author_Institution :
Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China
Abstract :
To handle high dimensional variables in real world, especially multimedia data, dimension reduction techniques provide effective solutions for feature selection which makes the problem easy to deal in a lower dimension subspace. However, the primary problem with traditional dimension reduction method is to estimate intrinsic dimension of manifold supporting the raw data. Since all existing approaches of dimension reduction need to set the target dimension first. And for task of classification, the dimension we need maybe different from the intrinsic one, which means feature with intrinsic dimension may offer too much information for the classification task. In this paper, we propose a method to estimate classification related dimension with the help of RBM (Restricted Boltzmann Machine) and SVM classifiers. RBM is used for dimension reduction by mapping raw data to a low dimensional hidden space, and SVM classifier is used to test information preserving ability of this hidden space. An optimal dimension is selected as the minimum one that preserves all the classification related information of novel vectors. Further, a novel initialization strategy is proposed to speed up the training of RBM. These new methods are with low time consumption and memory cost compared to many other approaches of dimension estimation. Experiments and comparisons on several synthetic and realistic datasets show the superiority of the proposed method.
Keywords :
Boltzmann machines; learning (artificial intelligence); pattern classification; support vector machines; RBM training; SVM classifiers; classification related information preservation; classification-related manifold dimension estimation; dimension reduction techniques; feature selection; high-dimensional variable handling; information preserving ability test; initialization strategy; intrinsic dimension estimation; low-dimensional hidden space; lower-dimensional subspace; memory cost; multimedia data; optimal dimension selection; raw data mapping; realistic datasets; restricted Boltzmann machine; synthetic datasets; time consumption; Accuracy; Estimation; Manifolds; Support vector machine classification; Training; Vectors; classification related; dimension estimation; restricted boltzmann machine;
Conference_Titel :
Image and Graphics (ICIG), 2013 Seventh International Conference on
Conference_Location :
Qingdao
DOI :
10.1109/ICIG.2013.174