DocumentCode :
243646
Title :
Rényi Divergence Based Generalization for Learning of Classification Restricted Boltzmann Machines
Author :
Qian Yu ; Yuexian Hou ; Xiaozhao Zhao ; Guochen Cheng
Author_Institution :
Sch. of Comput. Software, Tianjin Univ., Tianjin, China
fYear :
2014
fDate :
14-14 Dec. 2014
Firstpage :
692
Lastpage :
697
Abstract :
As a derivative of Restricted Boltzmann Machine (RBM), classification RBM (Class RBM) is proved to be an effective classifier with a probabilistic interpretation. Several elegant learning methods/models related to Class RBM have been proposed. This paper proposes and analyzes a Rényi divergence based generalization for discriminative learning objective of Class RBM. Specifically, we extend the Conditional Log Likelihood (CLL) objective to a general learning criterion. We prove that, some existing popular training methods can be derived from this generalization, via adjusting the parameters to specific values. Intuitively, the regularization with different settings of parameters constrain the learned RBM distribution in different ways, and the parameter setting that provide a suitable distribution constraints for a particular sample set leads to the optimal performance. Moreover, we show that this generalized criterion actually extends the CLL objective with a Rényi divergence-based regularization. The uniform distribution used in this divergence-based regularization can be replaced by some sample-based distribution. This modification is applicable to any specific case of the general objective, and we call the appended loss as general margin. The proposed generalization enables an effective model selection procedure and experiments on human face recognition and document classification achieved significant performance improvement over the existing learning methods. It is also shown empirically that general margin loss is able to stabilize the parameter sensitivity and further improve the performance of the classifiers.
Keywords :
Boltzmann machines; document handling; face recognition; generalisation (artificial intelligence); learning (artificial intelligence); pattern classification; statistical distributions; CLL objective; Class RBM; Rényi divergence based generalization; Rényi divergence-based regularization; classification RBM; classification restricted Boltzmann machines; conditional log likelihood objective; discriminative learning objective; distribution constraint; document classification; general learning criterion; human face recognition; learning method; learning model; parameter sensitivity; probabilistic interpretation; sample-based distribution; training method; Databases; Educational institutions; Error analysis; Face recognition; Learning systems; Linear programming; Training; Classification RBM; Discriminative Learning; Rényi Divergence;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4799-4275-6
Type :
conf
DOI :
10.1109/ICDMW.2014.17
Filename :
7022663
Link To Document :
بازگشت